MST224 
Mathematical methods 


Handbook 


The Open 


University 


MST224 
Mathematical methods 


Handbook 


The Open 
University 


This publication forms part of an Open University module. Details of this and other Open University 
modules can be obtained from Student Recruitment, The Open University, PO Box 197, Milton 
Keynes MK7 6BJ, United Kingdom (tel. +44 (0)300 303 5303; email general-enquiries@open.ac.uk). 


Alternatively, you may visit the Open University website at www.open.ac.uk where you can learn 
more about the wide range of modules and packs offered at all levels by The Open University. 


The Open University, Walton Hall, Milton Keynes, MK7 6AA. 
First published 2013. Second edition 2016. 
Copyright © 2013, 2016 The Open University 


All rights reserved. No part of this publication may be reproduced, stored in a retrieval system, transmitted 
or utilised in any form or by any means, electronic, mechanical, photocopying, recording or otherwise, without 
written permission from the publisher or a licence from the Copyright Licensing Agency Ltd. Details of such 
licences (for reprographic reproduction) may be obtained from the Copyright Licensing Agency Ltd, Saffron 
House, 6-10 Kirby Street, London EC1N 8TS (website www.cla.co.uk). 


Open University materials may also be made available in electronic formats for use by students of the 
University. All rights, including copyright and related rights and database rights, in electronic materials and 
their contents are owned by or licensed to The Open University, or otherwise used by The Open University as 
permitted by applicable law. 


In using electronic materials and their contents you agree that your use will be solely for the purposes of 
following an Open University course of study or otherwise as licensed by The Open University or its assigns. 


Except as permitted above you undertake not to copy, store in any medium (including electronic storage or 
use in a website), distribute, transmit or retransmit, broadcast, modify or show in public such electronic 
materials in whole or in part without the prior written consent of The Open University or in accordance with 
the Copyright, Designs and Patents Act 1988. 


Edited, designed and typeset by The Open University, using the Open University TRX System. 
Printed in the United Kingdom by Halstan & Co. Ltd, Amersham, Bucks. 


SUP 05024 3 
21. 


Contents 


Introduction 


1 


Notation 

1.1 Greek alphabet 
1.2 Symbols 

1.3. Limits and sums 


Numbers 
2.1 Real numbers 
2.2. Complex numbers 


Functions and graphs 

3.1 Functions 

3.2 Polynomials 

3.3 Exponentials and logarithms 

3.4 Graphs of some common functions 


Trigonometry 

4.1 Radians and degrees 

4.2 Trigonometric functions and their inverses 
4.3 Two useful triangles 

4.4 Trigonometric identities 

4.5 General sinusoidal functions 


Geometry 

5.1 Cartesian coordinates 
5.2 Polar coordinates 

5.3 Plane figures and curves 


Differentiation 

6.1 Notation and terminology 

6.2 Rules of differentiation 

6.3 Standard derivatives 

6.4 Stationary points 

6.5 Curve sketching 

6.6 ‘Taylor polynomials and series 


Integration 

7.1 Notation and terminology 
7.2 Rules of integration 

7.3 Standard integrals 


© Cc NID DD or ot OU 


ae 
re © 


PR RFR ee 
oP BW WwW WwW 


Pree 
NINO 


NOnNnNnwnee 
WNNrFrOWO © 


wNnww WN 
oo WwW f& 


Contents 


Contents 


Unit summaries 


U 


U 
U 
U 
U 
U 
U 
U 
U 
U 
U 
U 


nit 2 
nit 3 
nit 4 
nit 5 
nit 6 
nit 7 
nit 8 
nit 9 
nit 10 
nit 11 
nit 12 
nit 13 


Index 


First-order differential equations 
Second-order differential equations 
Vectors and matrices 
Linear algebra 
Systems of linear differential equations 
Functions of several variables 
Multiple integrals 
Differentiating scalar and vector fields 
Integrating scalar and vector fields 
Fourier series 
Partial differential equations 
Non-linear differential equations 


27 
at 
29 
a2 
Al 
46 
48 
52 
57 
60 
64 
67 
70 


74 


Introduction 


Introduction 


This handbook is a reference that you can take into the MST224 exam, 
and it may be valuable when you start to apply your knowledge in other 
modules. It will be more effective if you are already familiar with it before 
you sit the exam, and we suggest that you consult it when you attempt the 
assignments in the module. This handbook is not designed as a teaching 
document, and reading it is not a substitute for studying the module units. 


The first few sections consist of general mathematical reference material, 
largely based on the topics in Unit 1. It is not intended to be a 
self-contained or logically complete account of basic mathematics; it is just 
a set of definitions and results. Some of these results are used repeatedly in 
the module. You will also find some additional results or definitions that 
are not covered in Unit 1, but are useful to have in a mathematical 
reference booklet. 


The later sections of the handbook are brief summaries of the units, 
emphasising the most important results. 


1 Notation 


1.1 Greek alphabet 

a A alpha e I iota p P rho 

B B- beta kK K_ kappa o » sigma 
y T gamma » A lambda 7 T tau 

6 A delta uw M mu v Y upsilon 
€ E_ epsilon vy N nu @ ® phi 

¢ Z zeta £ x xy X chi 

7 H eta o O- omicron wy W psi 

9 © theta a II pi w Q omega 
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1.2 Symbols 


is equal to 

is not equal to 

is approximately equal to 
plus or minus 

minus or plus 

less than 

less than or equal to 
greater than 

greater than or equal to 
positive square root 

the number 2.718 28... 
the number 3.14159... 
infinity 

the integers 

the real numbers 

the complex numbers 


REG I A ATH 0 tK II 


1.3. Limits and sums 


An ordered list of numbers xo, 71, 22,...,2n,--. iS said to converge to the 
? ? “4 i ? 
limit x if successive terms in the ordered list are better and better 
approximations to x. We write ‘rz, > 7 as n> oo’ or‘ lim rp, = 2’. 
nN—-0o 


Given numbers @1, @2,...,@n, we define 


n 
y a; tomean ayt+agt+-::+dn_1 +n. 
i=1 


2 Numbers 


2.1 Real numbers 


The integers are the positive and negative whole numbers, together with 
zero. Non-integer numbers that can be expressed exactly as fractions are 
called rational numbers; those that cannot be so expressed, such as V2, 
e and 7, are called irrational. The collection of the rational numbers 
(including the integers) and the irrational numbers is called the set of real 
numbers. 


When a real number is expressed in decimal notation, if it is approximated 
then the approximation can be given to so many decimal places, or to so 
many significant figures. For example, 1.4142 is the approximation to 
/2 to four decimal places and five significant figures, while 0.000 000 342 is 
given to nine decimal places but three significant figures, and 342000 has 
no decimal places but at least three significant figures. The process of 


2 Numbers 


reducing the number of decimal places or significant figures to which a 
number is expressed is referred to as rounding. To round a given number 
to n decimal places or n significant figures, take the number expressed to 
n decimal places or n significant figures that is closest to the given 
number, where it is conventional to round the digit 5 up. (Be aware that 
other conventions for rounding exist, and that computer programs and 
calculators do not always use this convention.) 


A number given in the form +b x 10°, where 1 < b < 10 and © is an 
integer, is said to be in scientific notation. For example, the number 
342 000 can be expressed as 3.42 x 10° in scientific notation. 


For any two real numbers a and 6 with a < b, we write: 


[a, 


a, 6] for the set of all real numbers x such that a < x < b; 
(a,b) for the set of all real numbers x such that a < x < b; 

(a, b| for the set of all real numbers x such that a < x < b; 

(a,b) for the set of all real numbers x such that a < a < b. 
These sets of numbers are called intervals. The interval |a, | is a closed 
interval, (a,b) is an open interval, and the two others are half-open 
intervals. For practical problems the distinction between open and closed 
intervals is rarely significant. If we know that a real number « is 1.274 to 
three decimal places, then z lies in the interval [1.2735, 1.2745). 


2.2 Complex numbers 


A complex number z is written in Cartesian form as z = a+ bi, where 
a and b are real numbers, and i? = —1. We refer to a as the real part 
of z, written Re(z), and to b as the imaginary part of z, written Im(z). 


Complex numbers can be added, e.g. 
(a+ bi) + (e+ di) =(a+c)4+(b4+d)i, 
or multiplied, e.g. 
(a+ bi)(c + di) = (ac — bd) + (ad + bc)i. 
These formulas make use of the ordinary rules of algebra, together with 
the relation i? = —1. 


The complex conjugate of z=a+ bi is Z7=a-— bi. Note that The notation z* is also used in 


= ' : 2 2 some texts for the complex 
zz = (a+ bi)(a— bi) =a°+b conjugate of z. 


is a positive real number (unless a = b = 0). The modulus of z is the z and z* are read as ‘z bar’ and 


number |z| = /zz = Va* + 0?. ‘z star’. 


To calculate a quotient of complex numbers, multiply top and bottom by 
the complex conjugate of the bottom, e.g. 


a+bi  (a+bi)(c—di) _ fac+bd be — ad ‘ 
c+di (ce+di)(c—di) \e+d c+ d2)° 
which expresses the quotient in the form p+ qi with p and q real. 
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Polar coordinates are discussed 
in Subsection 5.2. 


The Argand diagram is a representation of complex numbers as points 
in a plane, where the complex number a + bi is represented by the point 
with Cartesian coordinates (a,b). A point can also be represented in polar 
coordinates as (r,@) and related to its Cartesian coordinates by 


a=rcos@, b=rsiné. 


The angle coordinate @ is referred to as an argument of z, written arg(z), 
and the unique value of # in the range —7 < 6 < 7 is referred to as the 
principal value of the argument, written Arg(z). 


The multiplication of complex numbers in polar form is given by the rule 
(r,0) x (s, 6) = (rs,8+ ¢). So powers of a complex number can be 
expressed as (r,09)” = (r",n6), for n a positive integer. The special case 
when r = 1 is known as de Moivre’s theorem: 

(cos 6+ isin 8)” = cos(n@) + isin(né). 


Euler’s formula is 


0 


e” =cos6+ isin#. 


This extends to any complex number a+ bi as 


ao bt — (cos b + isinb). 


=e 
The exponential form of a complex number z = r(cos@ + isin @) is 
Z=7e", 


This form is useful for multiplying, dividing and taking powers of complex 
numbers. 


3 Functions and graphs 


3.1 Functions 


A variable is a quantity, represented by a symbol, that can vary over a set 
of values. If its value does not vary, then it is a constant. 


Any expression or formula that involves a variable x, and whose value is 
uniquely determined by the value of x, is called a function of x. 


If a variable y is a function of x (i.e. if y is equal to a function of x), then 
we call xz the independent variable and y the dependent variable, and 
we may write y = y(). Here y(a) stands for the function of x (i.e. for the 
formula involving x). 


If f and g are two functions, then their sum is a function f + g defined by 


(f+9)(@) = f(@) +9(@) (for all x). 


Moreover, if A and B are any two numbers, then the function Af + Bg is 
defined by 


(Af + Bg)(z)=Af(x)+ Bag(x) (for all x). 


3 Functions and graphs 


The function f(g(a)) is called the composite function or composition 
of the functions f and g. 


The graph of a function f(x) is the curve in the xy-plane whose equation 
is y = f(z). 


A continuous function is one whose graph has no breaks or jumps in it, 
ie. it can be drawn without lifting your pen from the paper. 


A constant function f(z) is one that assigns the same value to any 
input. Its graph is a straight line parallel to the z-axis. A special case is 
the zero function, which assigns the value 0 to any input. 


A linear function is one having the form a;x + ao (with a; 4 0), where 
a, and ag are constants. Its graph is a straight line with slope a; and 
y-intercept ao. 


A quadratic function is one having the form ag? + a1x2 + ao (with 

ag # 0), where ag, a1, a9 are constants. Its graph is a parabola, similar in 
shape to the one shown in Subsection 5.3 if a2 > 0, but the other way up if 
ag < 0. 


A cubic function is one having the form a3x? + a2x? + a,x + ag (with 
a3 # 0), where a3, a2, a1,@9 are constants. 


3.2 Polynomials 


Linear, quadratic and cubic functions are all particular examples of 
polynomial functions, or simply polynomials. An nth-order polynomial, 
or polynomial of degree n, is a function of the form 


-1 
Ant” + ann" +--+ + a,x + a0, 


where n is a positive integer, x is a variable and ao,a1,...,a@n are 
constants with a, 4 0. A linear polynomial has n = 1 (orn = 0), a 
quadratic polynomial has n = 2, and a cubic polynomial has n = 3. 


The roots of a polynomial p(x) are the solutions of the equation p(x) = 0. 
Every polynomial of degree n can be written as a product of a, and 

n factors of the form x — c, (k= 1,2,...,n), with each cz, a complex 
number (which may be real). Each of these factors corresponds to a root 

x = cy of the polynomial. If a factor x — c occurs more than once, then the 
root x = c is a repeated root; repeated roots are also sometimes referred 
to as equal roots or coincident roots. 


The roots of a quadratic equation az” + br +c = 0, a £ 0, are given by the 
formula method as 


—b aE. b2 — 4ac 
2a , 


The quantity b? — 4ac is referred to as the discriminant of the quadratic 
equation. 
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Some texts refer to y = ba” as 
an exponential function; this is 
not to be confused with the 
exponential function, exp x. 


10 


To factorise a polynomial is to express it as a product of two or more 


polynomials of lower degree. For example, the difference of two squares 


x? — a? factorises as 


zg’ —a? = («# —a)(x +a), 


and the perfect square x? + 2ax + a? factorises as 


az” + 2an +a" = (2 + a)(2+a) = (x +20)". 


3.3. Exponentials and logarithms 


A function of the form y = ba”, where a and b are non-zero constants (with 
a>0O and a#1), and z is real, is said to exhibit exponential 
behaviour. In a”, a is referred to as the base and x as the exponent (or 
index or power). Properties of such a function include 

a? = 1, 

a * =1/a", 

a® x a? =a", 

eig=n°*, 

ah =e" = (ar), 
The function e*, where e = 2.718 28..., is referred to as the exponential 
function. It is also written as exp. 


The natural logarithm function In is defined to be the inverse 
function of the exponential function exp, i.e. each reverses the effect of 
the other, so that 

In(expz) =a for all real z, 

exp(Inz) = 2 for all real x > 0. 
In other words, if e¥ = x, then y = Inz, and vice versa. 
The natural logarithm function In x, for x > 0, has the properties 

Inl =0, 

In(1/z) = —Inz, 

In(zy)=Ing+Iny, y>0O, 

In(a/y) =Inz—Iny, y>O, 

Ing’ = ylnz. 
Any function y = ba® can be written in the form y = be**, where k = Ina. 


Another logarithm function is log;g 2, for z > 0, where y = logy, x if 
10¥ = x (and vice versa). The properties given above for In also hold for 
logio- 


Although they are not used in this module, we note for general reference 
purposes that the hyperbolic functions sinh x, coshz and tanh are 
defined as combinations of exponential functions: 
sinh x = $(e” —e~*), 
cosh z = $(e* +e7*), 
sinha e*—e* 


tanh z = — : 
cosha e*+e-% 


The inverses of these functions are 
arcsinh z = In(x + Va22+1), 
arccoshz =In(r+ Va2?-1), 2>1, 


1 
arctanh 7 = 4m (+=) « wel< i. 
—2£ 


3.4 Graphs of some common functions 


3 Functions and graphs 
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4 Trigonometry 


4.1 Radians and degrees 


In this module we usually measure angles in radians rather than degrees. 
There are 27 radians in a full circle, corresponding to 360°, so 1 radian is 
(180/7)° =~ 57°. An advantage of working in radians is the simplicity of the 
formula for the arc length subtended by an angle in a circle of radius r: 
the length of the arc subtended by an angle of 6 radians is simply r@é. The 
following radian measures of standard angles are worth knowing: 

e aright angle is > radians 


e the angles of an equilateral triangle are 3 radians. 


An angle is acute if its radian measure lies between 0 and 5, and obtuse 
if its radian measure lies between 5 and 7. 


4.2 Trigonometric functions and their inverses 


For an acute angle 0, the values of trigonometric functions are related to 
the ratios of lengths of the sides of a triangle as follows. 


Function Definition for acute angles Definition in terms 


in terms of triangle shown of sin and cos 

sin @ = 

Cc 

b 
cos 0 = 

Cc 

a sin 0 
tan 0 = 

b cos 0 

b 6 
cot 4 - ti 

a sin 8 

c 1 
sec 0 = 

b cos @ 

1 

cosec 6 = - 

a sin 0 


4 Trigonometry 
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Inverse function Definition 


arcsin © = 6 where sind=2 
arccos © = 6 where cosd = 2 
arctan x = 6 where tand=2z 
arccot x = 6 where cotd=2z 
arcsec © = 6 where sec9 = x 


arccosec x = 6 where cosec@ = x 


4.3 Two useful triangles 


From the two triangles in the margin it can be seen that: 


a | n_ v3 ae 2 
sing =3, cosg => tan ¢ = ap 
x Ze 2 zt . 
cosec ¢ = 2, sec G = Fy) cot 2 = V3; 
sin= = + cos= = + tan 4 = 
“ 4 /2? - 4 4/2? A ee 
cosec 7 = V2, sec 4 = V2, cot 7 = 1; 
nt = ¥3 nl — 
sing = >, cos F = 5, tan = V3, 
ea ZH ie 
coset 3 = Ty, sec 3 = 2, cot 3 = 7 


Other values of the trigonometric functions worth remembering are: 


sin0 = 0, cos0 = 1, tan 0 = 0; 
* — a ™ 

sins = 1, cos 5 = 0; 

sin 7 = 0, cos7 = —l, tan7 = 0; 
a 3m 

sin = = —1, cos = = 0. 


4.4 Trigonometric identities 


Pythagoras’s theorem states that for any right-angled triangle, if c is 
the length of the hypotenuse (the side opposite the right angle) and a and 
b are the lengths of the other two sides, then 


=a? +b’. 
This leads to the following trigonometric identities: 
sin? 6 + cos? @ = 1, 
tan? 6+ 1 = sec? 0, 
1 + cot? 6 = cosec? 6. 


Addition formulas 
sin(a + 3) = sinacos 3 + cosasin B, 
sin(a — 3) = sinacos 8 — cosasin B, 
cos(a + 8) = cosacos 8 — sinasin G, 
cos(a@ — 3) = cosacos 8+ sinasin £, 
tana + tan 3 


t acc a ae 
alae P) 1—tanatan f’ 


tan a — tan § 
iio 
enh y) 1+ tanatan p’ 
sin acos § = $sin(a + f) + $sin(a — £), 
cos asin 3 = 4 sin(a + 8) — Fsin(a — f), 
cos a cos 8 = $cos(a + 8) + ¥c0s(a — 8), 
sin asin B = $cos(a — 8) — $ cos(a + f). 


In particular, these formulas give 


sin(a+ 27) =sina, cos(a+2m)=cosa, tan(a+7) = tana; 


sin(—a) =—sina,  cos(—a) = cosa, tan(—a) = — tana. 


Double-angle formulas 


sin 2a = 2sinacosa, 

cos 2a = cos? a — sin? 
2 tana 

tan 2a = ———z—, 
1 — tan-a 

sin? a = 3(1 — cos 2a) 

=5 : 


cos” a = $(1 + cos 2a). 


Cosines of related angles 
cos($—a)=sina,  cos($+a)=-—sina, 


cos(7 — a) = —cosa, cos(7+a) = —cosa. 


4.5 General sinusoidal functions 


A sinusoidal function or sinusoid is a function x(t) of time t of the form 


x= 29+ Acos(ut + ¢) = 29+ Asin(wt+y), 


where xg is a constant, A is a positive constant called the amplitude, w is 
a positive constant called the angular frequency, and ¢ and w are 


constants called phase constants. 


a= 1 — 2sin? a = 2cos* a — 1, 


4 Trigonometry 
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A sinusoidal function oscillates between x) — A and x9 + A, repeating the 
same pattern of oscillations through each time interval of length 27/w, 
known as the period of the function. For these reasons, sinusoidal 
functions are examples of oscillatory functions and of periodic functions. 


The phase constants in the two forms of the sinusoidal function are related 
according to ¢ = = +w (or equivalently ¢ = w— 4). 


Alternative forms of the sinusoidal functions are given by 


x= 2X9 + Acos(wt + d) = xp + Bcos(wt) + Csin(wt), (1) 
x= 29+ Asin(wt + w) = xo + Dsin(ut) + E cos(wt). (2) 


In equation (1) we have B = Acos¢ and C = —Asin@. Solving these 
equations gives A = VB? + C? and ¢ = arccos(B/A). 


In equation (2) we have D = Acos7 and EF = Asiny. Solving these 
equations gives A = VD? + EF? and 4 = arccos(D/A). 


5 Geometry 


5.1 Cartesian coordinates 


The Cartesian coordinates (z, y) of a point P in a plane specify the 
position of that point relative to two perpendicular axes, the x-axis (or 
horizontal axis) and y-axis (or vertical axis), which meet at a point O 
called the origin, with Cartesian coordinates (0,0). The directions of the 
axes indicate increasing numerical values for the z- and y-coordinates. 
Values of x to the right of the y-axis are positive, and those to the left are 
negative; similarly, values of y above the x-axis are positive, and those 
below are negative. The four parts into which a plane is divided by 
Cartesian coordinate axes are known as quadrants of the plane. A plane 
on which Cartesian coordinate axes have been specified is often referred to 
as the xy-plane. 


YA 
YA 
P Second quadrant | First quadrant 
OLr,Yy 
y-axis —_| (x,y) z<0 x>O0 
y>0 y>0 
~ O x 
origin zr<O0 zr>0 
O bs y <0 y <0 
L-AXIS Third quadrant | Fourth quadrant 


5.2 Polar coordinates 


The point P whose polar coordinates are (r,@) has Cartesian coordinates 
(x,y) where 


x=rcos), y=rsiné. 


The value of r is always positive (except at the origin, where it is zero). 
For a given point P, the value of # is not unique: we can add or subtract 
any integer multiple of 27 and obtain another value for @ that describes the 
same point. The value of 6 satisfying —a < 6 < 7 is called the principal 
value of 6. 


5.3 Plane figures and curves 


A closed plane figure with straight sides is called a polygon. A polygon 
with 3 sides is a triangle, one with 4 sides is a quadrilateral, one with 
5 sides is a pentagon, one with 6 sides is a hexagon, and in general one 
with n sides is called an n-gon. 


A polygon is said to be regular if all its sides have equal length and all its 
angles are equal. A regular triangle is referred to as an equilateral 
triangle, and a regular quadrilateral is a square. 


An isosceles triangle is one with two sides of equal length (or 
equivalently with two equal angles). A right-angled triangle is one in 
which one angle is a right angle. The angle sum of a triangle is 

m radians (180°). 


A parallelogram is a quadrilateral with opposite sides parallel. 

A rectangle is a parallelogram all of whose angles are right angles. 

A square is a rectangle all of whose sides have equal length. The angle 
sum of a quadrilateral is 27 radians (360°). 


The area of a triangle = 5 x base x height. 


The area of a parallelogram = base x height. 


height height 


base base 


The areas of a triangle and parallelogram 


5 Geometry 


The relationship between 
polar and Cartesian 
coordinates 
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A circle is a set of points in a plane that are a constant distance from a 
fixed point in the plane. The fixed point is the centre of the circle, and 
the constant distance is its radius. If a straight line cuts a circle at two 
points, then the segment of that straight line within the circle is known as 
a chord of the circle. The length of a chord that passes through the centre 
of a circle is the diameter of the circle. The terms diameter and radius 
are also used to refer to a chord through the centre of a circle and to a 
straight line from a point on the circle to its centre, respectively. 


radius 


arc 


angle subtended 
by arc 


Each continuous segment of a circle is known as an arc of the circle; the 
angle made at the centre of a circle by two radii drawn from the ends of the 
arc is known as the angle subtended by the arc. (In such circumstances 
the arc itself is sometimes called the arc subtended by the angle.) 


The distance around a circle is known as its circumference, and for a 
circle of radius r this is given by 2a7r. The area of a circle of radius r 
is mr?. The arc length of an arc of a circle of radius r subtended by an 
angle @ radians is r?. The area of a sector of a circle of radius r 
subtended by an angle @ radians is $r70. 


The equation of a circle in the xy-plane with centre (a,b) and radius r 
is (x — a)? + (y— 6)? = r?. Other curves in the ry-plane that can be 
represented by quadratic formulas are the ellipse, the parabola and the 
hyperbola, examples of which are shown below. 


COL )¢ 


Ellipse Parabola Hyperbola 


A straight line between two distinct points on a curve is known as a chord 
of the curve. A straight line that just touches a curve is known as a 
tangent to that curve at the point where it touches. 


6 Differentiation 
6 Differentiation 


6.1 Notation and terminology 


If f(x) is a function, then its derived function or derivative f’(x) is 
defined by 


f(x +h) — f(a) 


/ oe 
ie= ie h 


The process of calculating f’(x) from f(z) is called differentiation of 
f(x) with respect to a. Differentiation with respect to x can also be 


d 
denoted by the symbol as written to the left of the expression or variable 
x 


being differentiated, so that, for example, 


d df (x) , 
anf) and oe both mean f'(z). 
If y = f(x), then f’(x) can also be written as y’ or aw where to save space 
z 
d 
we often print dy/dx in place of a 


When the independent variable is t (time), we often use a dot to indicate a 
derivative, so that w means the same thing as u’(t) or du/dt. If x is the 
position of a particle along an axis, then dx/dt or & represents the 
velocity along the axis, and d?x/dt? or % represents the acceleration 
along the axis. 


The notation f’(z) is referred to as function notation, dy/dx as Leibniz 
notation, and u as Newton’s or Newtonian notation. 


The derivative of a derivative is called a second derivative. For example, 
the second derivative of the function f(x), denoted by f(z), is the 
derivative of f’(x). If y= f(x), then the second derivative is also written 
as y" or d?y/da?. If u is a function of t, then its second derivative can be 
written as w. 

Third and higher derivatives are defined analogously. The nth 
derivative of f is denoted by f\” or, if y = f(x), by d"y/dx”; n is referred 
to as the order of the derivative. The prime and dot notations are not 


used for higher derivatives, except that f”” is sometimes used in place 
of f®). 


A complex-valued function f(x) = g(x) + ih(a), where g and h are 
real-valued functions, can be differentiated in a natural way as 


f'(a) = g(a) + th'(a). 
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6.2 Rules of differentiation 


Constant multiple rule: If k is a constant and uw is a function of xz, then 
du 


(ku)! = ku’, or equivalently (eu =k ae 


Sum rule: If u and v are functions of x, then 


d d d 
(u+v) =u’ +’, or equivalently —(u+v) = Lani il 
dx dx dx 
Product and quotient rules: If wu and v are functions of z, then 
d du du 
(uv) =u'v+uv'’, or equivalently —(uv) = —v+u—, 
dx dx dx 
and 
Gi u'v — uv’ catanil d () du du j 3 
—} = —,—.,_ or equivalen —(-)=(|[—v-u— ]} /v’. 
v v2 a Y Ga\o dx dx 
This is sometimes called the Composite rule or chain rule: If g and u are two functions, and 


‘function of a function’ rule. h(a) = g(u(a)), then 


hi(a) = g'(u(z)) u'(2). 
Another way to write this is as 


dh _ dh du 

dx du dx 
Implicit differentiation: Given an equation connecting two variables x 
and y, we can use implicit differentiation to calculate dy/dx by 
differentiating both sides with respect to « and then solving the resulting 
equation algebraically for dy/dx (instead of solving for y before 
differentiating). For example, 

dy 


d 3,2 2.2 3 
= = 3277? + Qa y —2. 
mt Y) wy” + ary 


20 


6.3 Standard derivatives 


In each case, a is a constant. 


Function 


In(ax) 


sin(ax) 
cos(az) 
tan(ax) 
cot (ax) 
sec(ax) 
cosec(az) 


arcsin (az) 
arccos(ax) 
arctan(az) 
arccot(axr) 
arcsec(a2:) 


arccosec(ax) 


Derivative 


a cos(ax) 

—asin(ax) 

asec? (ax) 

—acosec?(az) 

asec(ax) tan(az) 

—acosec(ax) cot(ax) 
a 


V1 — ax? 


a 


1 — a?x? 
a 
1+ a2x? 
a 
1 +422? 


a 
lax|\Va2x? — 1 
a 
lax|Va2x? — 1 


6 Differentiation 
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6.4 Stationary points 


The gradient of a function f at a point xo is the slope of the tangent to 
the graph of f at that point, and is given by the derivative of f at that 
point, i.e. f’(zo). A function is increasing on an interval if its gradient is 
positive throughout that interval; it is decreasing if its gradient is 
negative throughout that interval. A stationary point of f is a point x 
where the gradient is zero, i.e. f’(xo) = 0. 


A function is smooth if it is continuous and has a continuous derivative. 
Consider a stationary point x9 of a smooth function f. 


e 2p isa local maximum if, for all z in the immediate vicinity of xo, 
f'(x) > Oif x < zo and f’(x) < Oif x > zo. An alternative condition is 
f'(ap) <0. 

e 2x9 isalocal minimum if, for all x in the immediate vicinity of xo, 
f(x) < Oif x < xp and f’(x) > 0 if « > zg. An alternative condition is 
f"(xo) > 0. 

e 29 isa point of inflection if f(z) = 0 and f”(x) changes sign as x 
increases through zo. 

A global maximum of a function f is a point xo such that f(xo) > f(x) 

for all x where f is defined. A global minimum of a function f is a 

point zo such that f(ao) < f(x) for all x where f is defined. A function f 

is bounded above by an upper bound A if f(a) < A for all « where f 

is defined. A function f is bounded below by a lower bound B if 

f(x) => B for all « where f is defined. 


6.5 Curve sketching 
The following is a possible procedure for sketching the graph of y = f(x), 
where f(x) is some given function. 


1. Check whether f(x) is a standard function whose graph you already 
know, or is a simple modification of such a function. If not, proceed to 
Step 2. 


2. Determine how y behaves when x is very large and positive, and when 
x is very large and negative. 


Look for any obvious symmetries or repetitions in the behaviour of f. 
Find where the curve crosses the z- and y-axes, if at all. 


Look for any values of x at which f(a) is undefined, and examine the 
behaviour of f(x) near these values of x. 

6. Find the locations of any local maxima, local minima or points of 
inflection. 

7. Try to determine whether there are any intervals over which the 
function is increasing or decreasing. 


6 Differentiation 


8. Transfer the information found in Steps 4 and 6 to a sketch graph, 
then use this information together with any information found in 
Steps 2, 3, 5 and 7 to try to sketch a smooth curve. If you are still 
unsure about any parts of the curve, choose suitable values of « and 
plot the corresponding points (x, f(a)) before completing the curve. 


Lines y = c where f(x) + cas x — too, and lines x = c where 
f(x) — +00 as x > c, where c is a constant, are known as asymptotes of 
the graph of the function f(z). 


6.6 Taylor polynomials and series 


Factorials 
For any positive integer n, we define n factorial, written n!, by 
ml=1x2x3x---x (n-1) xn. 


The first few factorials are 1! = 1, 2! = 2, 3! = 6, 4! = 24. We also define 
O! = 1. 


Taylor polynomials 


For a function f(«) with n continuous derivatives near x = a, the Taylor 
polynomial of degree n about + = a or the nth-order Taylor 
polynomial about x = a is 


pn(t) = f(a) + (© — a) f(a) + yo — a)? f"(a) +0 +(e a)" F(A), 


When used to approximate f(z) near x = a, we refer to this polynomial as 
the nth-order Taylor approximation to f(x) near « = a, and write 


1 1 
f(z) = fa) +@-afi@+F@- ay fa) ene l(t a)” f™ (a). 
In particular, n = 1 gives the tangent approximation 


f(a) ~ f(a) + (w— a) f'(a), 


and n = 2 gives the quadratic approximation 


F(z) = f(a) + (@ - a) f'(a) + 3@ — 0)’ F"(a). 


These approximations are good when z is close to a. 


Taylor series 


The Taylor series about x = a for a function f(x) with infinitely many 
continuous derivatives near x = a is 


f(a) = fla) + (@— a) f"(a) + @ — a)? f"(a) + 
+ “(2 ale” (a) 4+ 


= Sle a) Ma. 
n=0 
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Some standard Taylor series about xz = 0 


‘ — 1 3 1 5 n-1 1 2n-1 
pane eae es ested) Qn—D!” bo 
cost = 1— 5a + ae —---+(-1) Gn)” free, 


x 1 2 1 n 
Si et oe ter ae bg 
2! n! 


1 1 1 
a) Sesser er ake (—] < @ = 1), 


Small-angle approximations 


If the angle # is small (sometimes written 9 < 1) and is measured in 
radians, then we can obtain useful approximations by truncating the above 
Taylor series: 


¢ Integration 


7.1 Notation and terminology 


The indefinite integral of a continuous function f(z) is 


[f@a= Fate 


where F is a function such that F’(x) = f(x), known as an integral or 
antiderivative of f, and C is a constant, often referred to as an 
arbitrary constant or constant of integration. 


The definite integral of a continuous function f(a) from a to 6 is 


b 
[ f@ae= [Flt = FO) - Flo) 


where F' is any integral of f. The numbers a and 6 are called the lower 
limit of integration and upper limit of integration, respectively. If 
the areas bounded by the graph of f(a) above and below the x-axis 
between a and b are A, and Ag, respectively, then 


b 
/ f(z) = Ay = Ap. 
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The process of finding an indefinite or definite integral is known as 
integration, and the function f being integrated is known as the 
integrand. If F, and F> are two integrals of f, then they differ by a 
constant, ie. Fy (x) = Fo(x) + C, where C is a constant. 


7.2 Rules of integration 


Constant multiple rule: 


frt@ n= kf He) dx (where k is a constant). 


Sum rule: 

fue + g(x)) dx = [i@ dx + f oe) dx. 
Integration by substitution: 

| fo@ olde = [te )du (where u = g(z)), 
or in Leibniz notation 


[to Zar= [ ryau 


The following formula, which can be derived by integration by 
substitution, is also useful: 


[a | de tals )}+C (where g(x) # 0). 
g(x) 


Integration by parts: 


[f@ae)ae= Fe) ate) f Feo 


For definite integrals, 


[s@r@e=@ow R- f reo 


A function f is odd if f(—x) = —f(x). If f is odd and a is a positive 
constant, then 


” f(a) de = 


7.3 Standard integrals 


In each case, a is a non-zero constant, b is any constant, and n is any 
integer. When using the table below to obtain indefinite integrals, add an 
arbitrary constant. 
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Function 


a 


x® (a # —1) 


cosec(az:) 
sec? (ax) 
cosec? (ax) 
x sin(ax) 


x cos(ax) 


i 
xv? + a2 


Integral 


1 
| : 
= n |cos(ax)| 
* In |sin(az)| 
— In |sin(ax 
a 
1 
— In |sec(ax) + tan(az)| 
a 
1 
= In |cosec(a2z:) — cot(ax)| 


1 
—t 
- an(az) 


(sin(ax) — ax cos(az)) 


(cos(ax) + ax sin(az)) 


1 L 

— arctan (=) 
a a 
x—b 
In(x + Va? + a?) 
In|a + Vx? — a? | 


: 4b 
arcsin | — 
a 


Unit 2 summary 


Unit summaries 


Unit 1 Getting started 


The material in Unit 1 is summarised in Sections 1-7 of this Handbook. 


Unit 2 First-order differential equations 


is 


A differential equation is an equation that relates an independent 
variable x, a dependent variable y, and one or more derivatives of y. 
Its order is the order of the highest derivative that appears. So a 
first-order differential equation for y = y(x) contains its first 
derivative, dy/dx, but no higher derivative of y. This unit considers 
only first-order differential equations that can be expressed in the form 


d 
=. = fle), 


where f(x,y) stands for some expression that may contain either or 
both of the variables x and y. 


A solution of a differential equation is a function y = y(x) that 
satisfies it. A solution that is written in the form ‘y = function of x’ is 
an explicit solution; otherwise it is an implicit solution, i.e. an 
equation of the form F(z, y) = 0 for some function F. 


The general solution of a differential equation is the collection of all 
of its solutions. It is usually possible to give the general solution of a 
first-order differential equation as a formula containing one arbitrary 
constant. A particular solution of a differential equation is a single 
solution containing no arbitrary constant. 


An initial condition for a first-order differential equation 

dy/dx = f(x,y) is an assignment of a value yo that the dependent 
variable y must take when the independent variable z takes some 
given value x9. An initial condition may be specified in the form 
‘y = yo when x = 2’ or ‘y(%o) = yo’; Zo and yo are referred to as 
initial values. 


An initial-value problem is to find the particular solution of a 
differential equation that satisfies a given initial condition. Given the 
general solution of a differential equation involving an arbitrary 
constant C’, we can determine the solution satisfying an initial 
condition by substituting the initial values into the general solution; 
this gives an equation from which the required value of C' can, in 
principle, be found. 
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5. 


Some differential equations have an analytic solution, i.e. an explicit 
general solution derived using calculus. To decide whether an equation 
of the form dy/dx = f(x,y) may be solved analytically by one of the 
methods described in the module, proceed as follows. 


(a) If f(x,y) is independent of y, so that f(x,y) = f(x), then the 
equation may be solved by direct integration: its general 
solution is y = F(x) + C, where F(z) is an integral of f(x) and C 
is an arbitrary constant. 

(b) If f(x,y) has the form f(#,y) = g(x) h(y), then use the method of 
separation of variables described below. 


(c) If f(x,y) has the form f(x,y) = h(x) — g(x) y (so that the 
equation is of the form dy/dx + g(x) y = h(x)), then the equation 
is linear, and may be solved by the integrating factor method 
described below. 


To solve the differential equation dy/dx = g(x) h(y), where h(y) 4 0, 
using the method of separation of variables, proceed as follows. 


(a) Divide both sides of the equation by h(y), and integrate with 
respect to xz, to obtain 


fans [o@ae 


(b) If possible, perform the two integrations, obtaining an implicit 
form of the general solution, which should include one arbitrary 
constant. 


(c) If possible, rearrange the formula found in Step (b) to give y in 
terms of x; this is the explicit general solution of the differential 
equation. In addition, there may be supplementary solutions if y 
is a constant satisfying h(y) = 0. 


A first-order differential equation is linear if it can be expressed in the 
form 


It is homogeneous if h(x) = 0 for all x, inhomogeneous otherwise. 
For all the cases that you will meet in this module, the initial-value 
problem 


“ + g(x)y=h(x), y(x0) = yo, 


has a unique solution. 


Unit 3 summary 


Linear first-order differential equations can be solved by the 
integrating factor method. An integrating factor for the 
equation dy/dx + g(x) y = h(x) is a function p(x) with the property 
that, after multiplication by p(x), the left-hand side of the equation 
becomes an exact derivative: 


dy d 
ote) (+ o(e)u) = £(0(@)s). 
Multiplication by an integrating factor therefore makes it possible to 


solve the equation by direct integration. 


The integrating factor for the equation dy/dx + g(x) y = h(x) is 


p(e) = exp ( [ g(e) dr). 


In the constant-coefficient case dy/dx + Ay = h(x), where A is a 
constant, p(x) = exp(Az). 


The steps of the integrating factor method are as follows. 


(a) Determine the integrating factor p(x). 


(b) Rewrite the equation as < (p(x) y) = p(x) A(z). 


(c) Integrate to obtain p(x) y = [oo h(x) dz + C, where C is a 
constant. 


(d) Divide through by p(x), to obtain the general solution in explicit 
form. 


Unit 3 Second-order differential equations 


1. 


A second-order differential equation is a differential equation that 
contains the second derivative d?y/dx? of the dependent variable y 
with respect to the independent variable x, but no higher derivative. 
Thus in addition to d?y/dx?, any or all of the following may occur in 
the equation: x, y, dy/dx. The general solution of such an equation 
normally involves two arbitrary constants. 
A linear second-order differential equation is one that can be written 
in the form 
d*y 
a(x) <4 + 6(0) H + e(2)y = F(2), 


where a(x), b(x), c(z) and f(x) are given functions. If f(z) is 
identically zero, then the equation is homogeneous; otherwise it is 
inhomogeneous. 
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A linear constant-coefficient second-order differential equation is 
one in which the functions a(x), b() and c(x) are constant (with 


a #0), ie. 
dy dy 
ava thy + cy = f(z), 
and f(x) is a given function. 


The methods of solution described in the unit make use, explicitly or 
implicitly, of the following principle of superposition. 


If yi(x) is a solution of ay” + by’ + cy = fi(x), and yo(x) is a solution 
of ay” + by’ + cy = f(x), then for any constants ky and ko, 


y(@) = ky yi (x) + ke yo(z) 
is a solution of the equation 
d2 


r 
ai tb + cy = ki file) + ke fala): 
£ dx 


The general solution of the homogeneous equation 


is obtained by following the procedure below. 
(a) Solve for \ the auxiliary equation 
a? +bA+e=0. 


(b) (i) Ifthe auxiliary equation has distinct real roots Ay and Ao, 
then the general solution of the differential equation is 


y(z) = Ce + De®. 


(ii) If the auxiliary equation has equal real roots Ay = A2 
(ie. a\? + bA + cis a perfect square), then the general 
solution of the differential equation is 


y(x) =(C + Da)e™*. 


(iii) If the auxiliary equation has complex conjugate roots 
Ay =a+if and Ao = a —if, then the general solution of the 
differential equation is 


y(z) =e“ (C cos Bx + Dsin Ba). 
In each case, C and D are arbitrary constants. 


For the oscillation represented by «(t) = Asin(wt + @), the constant 
A> 0 is called the amplitude, the constant w is called the angular 
frequency, and the constant ¢ is called the phase constant. The 
period of the oscillation is 27/w. 


Unit 3 summary 


The general solution of the inhomogeneous equation 


d*y , dy 


is given by y = Yc + Yp, where: 
e Yc, the complementary function, is the general solution of the 
associated homogeneous equation 


¢ Yp, a particular integral, is any particular solution of the 
original inhomogeneous equation. 


This leads to the following procedure for finding the general solution 
of an inhomogeneous equation. 


(a) Find the complementary function, by solving the auxiliary 
equation of the associated homogeneous equation. 


(b) Find a particular integral, as described below. 
(c) Add the particular integral to the complementary function. 


For certain inhomogeneous equations, particular solutions can be 
found by the method of undetermined coefficients. This method 
works when the function f(x) is a polynomial, an exponential function 
or a sinusoidal function, i.e. a linear combination of a sine and a cosine. 


The method involves using a trial solution y(x) of a form similar to 
that of f(z), but with coefficients that initially are undetermined. The 
coefficients are determined by substituting the trial solution into the 
differential equation and choosing the coefficients so that the equation 
is satisfied. 


Suitable trial solutions y(x) for specified right-hand-side (or target) 
functions f(x) are shown in the following table. 


Target function f(x) Trial solution y(x) 
MnL”™ + My—12" 1 +++: Pat” + pnt” 1 +++: 
+ mr + Mo + pix + po 
mek per 
mcoska + nsin kx pcoskx + qsinkx 
The coefficients in the left-hand column, m,k,mo,m1,..., are given 
constants. Those in the right-hand column, p, po, pi,..., are constants 


to be determined, by differentiating the expression for y(x) twice, 
substituting into the left-hand side of the differential equation, and 
equating coefficients of corresponding terms on the left- and 
right-hand sides. 


Note that even if some of the given coefficients in f(x) are zero — for 
example f(x) = 3x or f(x) = 2cosz — it is still necessary in general to 
use the full expression from the right-hand column of the table. 
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10. 


An initial-value problem for a second-order differential equation is 
a problem in which one has to find the particular solution y = y(zx) of 
a given equation such that y and its derivative y’ take specified values 
yo and zg, respectively, when the independent variable x takes the 
value zo. The numbers zo, yo and zp are called initial values. The 
relationships between initial values are called initial conditions. 
These may be specified either as ‘y = yo and y'! = z9 when x = zo’, or 
as ‘y(x0) = yo, y'(Zo) = 20’- 


An initial-value problem has a unique solution, which can be obtained 
by finding the general solution of the differential equation and 
substituting in the initial values to determine the two arbitrary 
constants that it contains. It is necessary first to differentiate the 
general solution to apply the initial condition y/(z9) = 20. 


In a boundary-value problem, a condition is placed on the value of 
either y or its derivative, or some combination of the two, at each of 
two different values of x. The conditions are referred to as boundary 
conditions, and values of x and y in these conditions are boundary 
values. Such a problem may have a unique solution, or no solution, or 
an infinite number of solutions. 


Unit 4 Vectors and matrices 


A vector is a mathematical object consisting of a non-negative real 
number called its magnitude, and a direction. 


In printed text, vectors appear in bold; in handwritten work they are 
written underlined. 


e Two vectors are equal if and only if they have the same 
magnitude and direction. 


e The magnitude of a vector v is denoted by |v| or sometimes by v. 

e The zero vector 0 has magnitude zero; no direction is defined for 
it. 

e The displacement from a point P to a point Q is represented by 
the displacement vector PQ. 


In discussions involving vectors, the word scalar is used to denote a 
real number (positive, negative or zero). 


Scaling a vector or scalar multiplication of a vector is the process 
of multiplying a vector by a scalar. 


For a vector v and (non-zero) scalar m, the scalar multiple mv is 
the vector whose magnitude is |m||v|, and whose direction is 


e in the same direction as v if m > 0 


e in the opposite direction to v if m < 0. 


Note that —v = (—1)v is the vector with the same magnitude as v 
but pointing in the opposite direction (such vectors are called 
antiparallel). 


Also note that Ov = m0 = 0 for any vector v and scalar m. 

A unit vector is a vector whose magnitude is 1. For v 4 0, the 
vector V = (1/|v|)v is a unit vector in the same direction as v. Unit 
vectors along the Cartesian axes are called Cartesian unit vectors, 
and are denoted by i, j and (in three dimensions) k. 

Vector addition is defined geometrically by the triangle rule (see 
the diagram in the margin). 


The vector a+ b is called the sum or resultant of a and b. Vector 
subtraction is defined by a— b = a+ (—b). 


Note that a—a=0 and 0+a =a for any vector a. 


A three-dimensional Cartesian coordinate system consists of three 
mutually perpendicular axes, usually labelled x, y and z, that meet at 
a point called the origin. 


A right-handed coordinate system is one satisfying the 
right-hand rule: 


e Point the straightened fingers of your right hand in the direction 
of the positive x-axis, and rotate your wrist until you find that 
you can bend your fingers in the direction of the positive y-axis. 

e Extend the thumb of your right hand. This is the direction of the 
positive z-axis for a right-handed coordinate system. 


Given a Cartesian coordinate system, any vector a can be written 
uniquely in component form as 


a=azi+a,j+azk, or equivalently a= (az,ay,az), 


where i, j and k are unit vectors in the directions of the positive z-, y- 
and z-axes, respectively. The numbers az, ay and a, are called the 
(Cartesian) components of a in the directions of i, j and k. The 
process of finding the Cartesian components of a vector is called 
resolving a vector into its components. 


Let a= azit+ ayj + azk and b = bzi+ byj + bk. Then: 
e jal=,/ai+az+a? 
e a+b= (az,+bz)i+ (a, + by)j + (a, + bz)k 


e ma= (maz)i+ (may)j + (maz)k. 


Unit 4 summary 


a+b 


a 


Triangle rule 


Here and below, we give the 
formulas for the 
three-dimensional case; the 
corresponding two-dimensional 
versions are obtained by 
omitting the third component. 
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10. 


The position vector of a point A is the displacement vector OA, 
where O is the origin. So if A has coordinates (x,y, z), then its 


—> “ 
position vector is OA = ri+ yj + zk. 


If P and Q are two points with coordinates (xp, yp, zp) and 
(zQ, YQ, 2Q), respectively, then the displacement vector PO i is 


PO = 00 — OP = (xq = xp)i i+ (va = yP)j + (za = zp)k. 
The vector equation of the straight line joining points P and Q 
whose position vectors are p and q, respectively, is 

r(t) =p+t(q—p) =(1—#)p+ia. 


If t is allowed to vary over all real numbers, then r(t) traces out the 
entire straight line through the points P and Q. If 0 <t < 1, then r(t) 
traces out the line segment from P to Q. 


The scalar product (or dot product) of two vectors a and b is the 
scalar defined by 


= |a| |b] cos 0, 


where @ (for 0 < @ < 7) is the angle between the directions of a and b. 
In particular, non-zero vectors a and b are perpendicular if and only if 
a:b= 0. Also, a-a=|a/’. 


e The scalar products of the Cartesian unit vectors are 


i<G=jjsk«k=i, i-fSj-k=k-i=6. 


e In component form, if a= azi+ ayj + azk and 
b = bzi + byj + bzk, then 


a b= a,b, + 0,0, + a,b, 
The formula |a|? =a+-a=a?2+ a; + a? is a particular case. 
e The angle 6 between two (non-zero) vectors a and b is given by 
Abe + Ayby t+azb, 


it~ [ag + ay + ay b2 +02 +02 


The component of a vector a in the direction of a unit vector 0 is 


cos @ = 


dy = a* i = |al cos, 
where @ is the angle between a and U. 
The Cartesian components of a are a- i, a-j and a-k; in other words, 


a=(a-i)i+(a-j)j+(a-k)k. 
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Unit 4 summary 


The vector product (or cross product) of two vectors a and b is 
the vector defined by 

a X b = ({a| |b] sin @) n, 
where 6 (for 0 < 6 < 7) is the angle between the directions of a and b, 
and n is a unit vector at right angles to both a and b, whose sense is 
given by the right-hand rule for vector products: this is like the 
right-hand rule for a coordinate system given above, but with the z-, 
y-, Z-axes replaced by a, b, fi, respectively. 
The vector product has the following algebraic properties: 
e axXb=-—(bxa). 
e axX(b+c)=(axb)+(ax c) and 

(a+b) xc=(axc)+(bxc). 
e = For any scalar \, (Aa) X b = A(a X b) =a X (Ab). 
e axXb=O0 if and only if one of a and b is a scalar multiple of the 


other — in other words, one of a and b is zero, or they are parallel 
or antiparallel. (In particular, a x a= 0.) 


e In general, a x (b xX c) #(aXb) Xc. 


e The vector products of the Cartesian unit vectors are 


ixi=jxj=kxk=0O, 
ixj=k, jxk=i, kxi=j. 


e Incomponent form, the vector product of the vectors 
a= azi+ a,j +azk and b = bi + b,j + b-k is 


a X b= (aybz — azby) i+ (azbe — azbz)j + (Arby — ayby) k. There is an easy way to 
. remember this formula in terms 
The area of a parallelogram whose sides are defined by vectors a of determinants: S68 items 29 
and b is ja x bj. and 27. 


The area of a triangle two of whose sides are a and b is $|a x bl. 


The volume of a parallelepiped whose sides through one vertex are 
a, b and¢, is |(a x b) +e]. 

A matrix is a rectangular array of elements (usually numbers) 
arranged in rows and columns. If a matrix has m rows and n columns, 
then it is said to be an m x n matrix (read as ‘m by n matrix’), or to 
be of order m x n. 


e Anm xn matrix A that has aj; as its element in the ith row and 
the jth column is written as 


ai a12 Qin 

a2) a22 a2n 
A= ; : 

AGm1 Am2 *** Amn 


or A = [ajj] for short. 


35 


Handbook 


The second form here is simply 
an alternative to the first, used 
to save space. (The superscript 
T stands for ‘transpose’, which 
is discussed in item 19.) 
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14. 


15. 


16. 


e A matrix with one column is often referred to as a column 
vector; a matrix with one row is a row vector. For example, a 
vector such as a = a,i + a2j + a3k may be written as the column 


vector 
at 
_ - T 
a= ja or a=[a1 ag as]. 
a3 


e A square matrix is one that has the same number of rows as 
columns. 


e Them xn zero matrix O is the m xX n matrix all of whose 
elements are zero. 


e Two matrices A = [a;;] and B = [b;;] are equal if they both have 
order m x n, and a;; = bj; for alli =1,2,...,m and 
j=1,2,...,n. 


If A = [a,j] and B = [};;] have the same order, then their sum is a 
matrix of the same order, given by A + B = [a;; + bij]. Thus the 
elements of A+ B are obtained by adding together the elements of A 
and B in the same positions. For example, for column vectors we have 


ay by dy + by 
aj} + |bo} = Jag + be 
a3 b3 a3 + b3 


Similarly for subtraction. 
The scalar multiple of a matrix A = [a;;] by a number 4 is given by 
AA = [Aaij]. 


Thus AA is obtained by multiplying each of the elements of A by A. 
For example, for column vectors we have 


ay Aa, 
r ag) = Aag 
a3 Aa3 


Matrix addition and scalar multiplication obey obvious rules of 
algebra. For any matrices A, B and C of the same order, and any 
scalar A, the algebra is: 

e commutative, A+B=B+A 

e associative, (A +B)+C=A+(B+C) 

e distributive, \(A + B) = AA + AB. 
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18. 


19. 


20. 


Unit 4 summary 


The product of an m x p matrix A = [a,j] and a p x n matrix 
B = [bj;] is the m x n matrix C = AB, where C = [c;;] is formed 
using the ith row of A and the jth column of B, to give 


p 
Gj = s, inde = ai1b1; + aigba; + +++ + Gipdp;. 
k=1 
We imagine placing the ith row of A on the jth column of B, 
multiplying the pairs of elements together, and then adding the 
products. 


The product of two matrices can be formed only if the number of 
columns of the first matrix is the same as the number of rows of the 


second. For example, the product of the matrix A = ° and the 


column vector r=[a yj" is given by 


a b| |x ax + by 
Ar = = : 
° i "| ie + | 
In general, matrix multiplication is not commutative, so AB may not 


be equal to BA. 


For any matrices A, B and C, of the appropriate sizes so that all 
products can be formed, matrix multiplication is: 


e associative, (AB)C = A(BC) 

e distributive over addition, A(B + C) = AB+ AC. 

The powers of a square matrix are defined in the obvious way: 
A? = AA, A? = AAA, and so on. 


The transpose A” of a matrix A is the matrix obtained by 
interchanging the rows and columns of A. If we denote the matrix A 
by [aij] and A” by [aj]; then al; = aj. If A is an m x n matrix, then 
A’ is an n x m matrix. 


e For any matrices A and B of the same order, (A7)’ = A and 
(A+B)? = AT+BI. 
e For any two matrices that can be multiplied, 
(AB)’ =B’A‘; 
note the change in order. 


The identity matrix is a square matrix I whose elements are all zero 
except the diagonal elements, which equal 1. For example, the 2 x 2 
0 


identity matrix is I = 0 41° 


Assuming that all the matrix products make sense, the identity matrix 
satisfies AI = A and IA=A. 
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Although we are free to use any 
row or column when applying 
Laplace’s rule, it is usual to use 
the first row. 
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22. 


23. 


24. 


The inverse of a square matrix A is the matrix, usually denoted A7!, 
of the same order such that AA7! = A“'A =I. 


Only a square matrix can have an inverse, but many square matrices 
do not have inverses; those that do not are called non-invertible (or 
singular), while those that do are called invertible (or 
non-singular). 


e T'=L. 


e If A and B are invertible matrices of the same size, then AB is 
invertible and 


(AB)“*=B A. 
note the change in order. 


e A2~x 2 matrix 
a b 
aml a 
is invertible if and only if ad — be # 0. When this condition holds, 
we can evaluate the inverse using the formula 


= 1 d =) 
AS 
ad — be & 


If A= ° 4 , then its determinant is a scalar, denoted by det A or 


‘ F given by det A = ad — bc. Thus A is invertible (non-singular) if 


det A #0, and non-invertible (singular) if det A = 0. 


The determinant of the 3 x 3 matrix 


ay ag a3 
A= |b, bo bs 
Cj CoQ C3 
is 
bo b by b by 6b 
det A = a1 2 3] 1 3) Le 1 2 
C2 3 Cy C3 Cy © 


= a1 (bec3 — bgc2) — a2(bicg — b3¢1) + a3(bice — bec1). (3) 


The determinant of an n x n matrix for n > 2 can be evaluated using 
Laplace’s rule. 


Given an n x n matrix A = [a;;], its determinant det A may be 
expanded in terms of the elements in row 7 and their cofactors Cj; as 


det A = aj1Cy + aaCig +--+ + GinCin, 


where Cy = (1) Mi;, and M;; is the minor obtained by deleting 
row 7 and column j of the original determinant and forming the 
determinant of what remains. 


This rule can be used to derive equation (3). 


25. 


26. 
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28. 


Unit 4 summary 


All determinants have the following properties. 


e Interchanging any two rows or any two columns of A changes the 
sign of det A. 


e = det(A”) = det A. 


e Multiplying any row or any column of A by a scalar k multiplies 
det A by k. 


e = For any scalar k and any n x n matrix A, det(kA) = k” det A. 


e Adding a multiple of one row of A to another row does not 
change det A. Likewise for columns. 


e Ifa matrix A is invertible, then det(A) 4 0 and 
det(A7') = 1/det(A). 
e For any two matrices that can be multiplied, 
det(AB) = det(A) det(B). 
To find the inverse of an n x n matrix A = [a;;], do the following. 
e Evaluate det A and confirm that det A 4 0. (If det A = 0, then 


no inverse exists.) 


e =Evaluate the cofactor Cj; of each element a;;, using the relation 
Ci; = (—1)"*7.Mj;, where Mj; is the minor obtained by deleting 
row 7 and column j of the original determinant and forming the 
determinant of what remains. 


e Form the square matrix C = [C;;] composed of the cofactors Cj;. 
e Take the transpose of C to obtain the matrix C7. 
e Scale the matrix C7 by 1/det A to obtain the inverse of A: 
a__! or 
det A 
The vector product b x c of vectors b= [b, by bz]? and 


c=|% & cz]" can be expressed as a 3 x 3 determinant: 
i jk 
bxc=|b, by eI. 
Cy By & 


Lea=[ wt Gl.bHeh & tf ede=i 6 «@. 
Then the scalar triple product a- (b X c) is given by 


e The volume of the parallelepiped with sides defined by the vectors 
a, b and c is given by the modulus of the scalar triple product, 
and therefore by the modulus of the determinant above. 


e Because interchanging rows of a matrix just changes the sign of its 
determinant, the scalar triple product obeys the cyclic identity 


a-(b xc) =b-(c X a)=c- (aX b). 


39 


Handbook 


40 


29. Any linear transformation of the plane can be represented by a 
2x 2 matrix A, called the transformation matrix. Its effect on the 
position vector r is to transform it into the matrix product Ar. So if 


a-[' 


a b 


d andr=[z yj’, then 


ja b] ja] — jaw + by 
Ar=|° Mea eeee 


So the matrix A transforms the position vector r= [x y]” to the 
position vector [az + by cx dy)’. 


The Cartesian unit vectorsi=[1 0]? andj=[0 1]? are 
transformed into the columns of the transformation matrix A. 


The unit square whose sides are i and j is transformed into a 
parallelogram with area |det A|, the modulus of det A. 


A singular (non-invertible) matrix transforms the unit square into 
a line (or a point if A = 0). 


Performing two successive transformations, represented by first A 
then B, is equivalent to the effect of a single transformation, 
represented by the matrix product BA (note the order). 


The inverse A~! of a transformation matrix A reverses the effect 
of the transformation. 


The dilation matrix 
K O 
D(K, A) = fi 1 
is a linear transformation that rescales the plane by x in the 
z-direction and in the y-direction. 


The inverse of a dilation matrix is another dilation matrix: 


D~!(«, A) = D(1/k,1/A). This exists only if « 40 and \ 40. 


The rotation matrix 


Ria} = ie a —sin | 


sin a COS 


is a linear transformation that rotates the plane by an angle @ 
about the origin, in the anticlockwise direction. 


The inverse of the rotation matrix R(a) is R7!(a) = R(—a). 


Unit 5 summary 


Unit 5 Linear algebra 


1. 


An equation involving n variables x71, 72,...,2%n is said to be linear in 
each of those variables if it can be written in the form 


a1 21 + AQ%2 + A323 ++++ + AnIn = 6, 
where @j,@2,...,@n, and b are constants. 


Consider a system of linear equations 


A111 + Ay2%2 +--+ + AinTn = b4, 
2121 + A22%Q +--+ + GonTn = be, 


Gn1X1 + An2%2 +++: + Anntn = bn, 


where there is the same number n of equations as of unknowns 
%1,%2,...,2%p7. Such a system can be written in matrix form as 


Ax =b. 


Here A is the n x n square matrix A = [a,j], called the coefficient 
matrix, formed from the coefficients a;;. The vectors x and b are the 
column vectors x= [a1 zo ... dn)’ andb=(bi bo ... dnl’. 


Usually the elements of A and b are given, while those of x are 
unknown and to be solved for. 


The augmented matrix of the system is the n x (n+ 1) matrix that 
has the coefficient matrix for its first n columns and whose final 
column is the column vector b consisting of the right-hand sides of the 
equations. The augmented matrix is usually written Alb, with a 
vertical bar separating the coefficient matrix A from the 
right-hand-side vector b. 


The leading or main diagonal of a matrix A = [a;;] is the collection 
of elements a;;. For a square matrix it is the diagonal that runs from 
the top left corner to the bottom right corner. 


An upper triangular matrix is a square matrix in which each entry 
below the leading diagonal is 0. A lower triangular matrix is a 
square matrix in which each entry above the leading diagonal is 0. 

A matrix that is upper triangular, lower triangular or both 

(i.e. diagonal) is sometimes referred to simply as a triangular 
matrix. 


In principle, any system of equations Ax = b, for A non-singular, can 
be solved by calculating the inverse of A using the methods of Unit 4, 
then setting x — A~'b. However, this method is increasingly 
inefficient for systems of three equations or more. A more efficient 
method is to use Gaussian elimination. 
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Performing a row operation 
really involves three sub-steps: 


e 
e 
e 


42 


write down the plan 
implement the plan 
relabel the rows. 


10. 


Gaussian elimination is an efficient and systematic method of 
obtaining the solution x of a system of linear equations Ax = b. 


To solve a system of n linear equations in n unknowns, with coefficient 
matrix A and right-hand-side vector b, by Gaussian elimination, carry 
out the following steps (if possible). 


(a) Formulation: Write down the augmented matrix Alb, denoting 
its rows by R1,..., Rn. 


(b) Elimination: Adapt the following row operations as necessary. 


(i) Subtract a multiple of Ry from Re, to reduce to zero the first 
element in the first column below the leading diagonal. 


(ii) Similarly, subtract a multiple of Ri from R3,...,Rn to 
reduce to zero all the other elements in the first column below 
the leading diagonal. 


(iii) In the new matrix obtained, subtract multiples of Ra from 
R3,..., Ry to reduce to zero all the elements in the second 
column below the leading diagonal. 


(iv) Continue this process until A|b is reduced to U|c, where U is 
an upper triangular matrix. 


(c) Solution: Solve the system of equations with coefficient matrix U 
and right-hand-side vector c by back substitution — that is, 
solve the nth row for x»; use this solution to solve the (n — 1)th 
row for 2,—1; continue in this way, working up the rows, at each 
stage substituting values already known. 


Sometimes during the Gaussian elimination procedure, an unexpected 
zero can turn up on the leading diagonal element of a row. This can 
make it impossible to use the row for the elimination procedure. To 
remedy this, interchange the row with one below, then continue as 
normal. 


The system of linear equations represented by the matrix equation 
Ax = bis said to be singular when the coefficient matrix A is 
singular (i.e. when det A = 0). Such a system does not possess a 
unique solution but, depending on details, may have no solution or an 
infinity of solutions. When the system has no solution, it is sometimes 
said to be inconsistent. 


If Gaussian elimination produces an upper triangular coefficient 
matrix U in which the final element on the leading diagonal is zero, 
then the back substitution process breaks down and the system is 
singular. In this case, if the final element in the final column is 
non-zero, then the system has no solutions, and if it is zero, then there 
may be an infinite number of solutions. 


The equations have a unique solution if and only if none of the 
elements on the leading diagonal of U is zero. 

The n vectors v1, V2,...,Vn are linearly dependent if there are 
numbers a1, Q2,...,Q@m, not all zero, such that 


ayVvy + Agvg + +++ + AnVn = 0. 


11. 
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13. 


Unit 5 summary 


In this case we can express one of the vectors as a linear combination 
of the others. 


The vectors are linearly independent if the only solution of the 
equation 


Q1V1 + Q9V2 +--+ +AnVn = 0 


isa, =a2=a3=...=Q,=0. 


Two (non-zero) vectors are linearly independent if they are not 
collinear, i.e. parallel or antiparallel. Three (non-zero) vectors are 
linearly independent if they are not coplanar, i.e. if they do not all lie 
in the same plane. 


The space to which vectors belong is often called a vector space. 
The dimension of a vector space is equal to the maximum number of 
linearly independent vectors that it allows. 


In an n-dimensional vector space, if you have n linearly independent 
vectors Vj,V9,...,V», then you can express any other vector as a 
linear combination of them: 


V=CcCiVvy + Cove +++: + CnVn- 


The set of linearly independent vectors v1, V2,...,Vn is called a basis 
for the n-dimensional vector space. 


An eigenvector of a square matrix A is a non-zero vector v such that 
Av = Xv for some scalar 4. The number 4 is the eigenvalue 
corresponding to the eigenvector v. Any non-zero scalar multiple of an 
eigenvector is also an eigenvector with the same eigenvalue. 


e Eigenvalues are distinct if they have different values; otherwise, 
they are said to be repeated. Eigenvectors are said to be distinct 
if they are linearly independent. 


e Ifa matrix has n distinct eigenvalues, then the corresponding 
nm eigenvectors are all linearly independent. 


e If some of the eigenvalues of a matrix are the same, then the 
corresponding eigenvectors may or may not be linearly 
independent; further investigation is required. 


If an n X n matrix has n linearly independent eigenvectors 
V1, V2,-..,Vn, then these can be used as a basis for n-dimensional 
vectors. 


So any n-dimensional vector v can be written as 
V=cyVvy + CoVQ + +++ + CnVn 


for some scalars ¢,¢2,...,C,. This is called the eigenvector 
expansion of v. 


The eigenvector expansion can be used to show that for (almost) any 
vector v and for large k, A“v is proportional to the eigenvector of A 
that has the eigenvalue with the largest modulus. 
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14. 


15. 


16. 


ile 


Ann xn matrix A has characteristic equation 
det(A — AT) = 0, 


where I is the n x n identity matrix. The left-hand side of this 
equation is a polynomial of degree n in X. Its roots are the eigenvalues 
of A. 


In principle, the eigenvalues and eigenvectors of a square matrix A can 
be found as follows. Find the roots of the characteristic equation of A 
to obtain the eigenvalues. For each eigenvalue \, solve the 
eigenvector equations (A — \I)v = 0 to obtain the corresponding 
eigenvector. 


If the eigenvalue is not repeated, then this will determine the 
eigenvector up to a scalar multiple. It is usually sufficient to choose 
some value for the scalar multiple to determine a suitable 
representative eigenvector. 


To find the eigenvalues of the 2 x 2 matrix A = ° : 


| , do the 
following. 
(a) Write down the characteristic equation det(A — AI) = 0. 
(b) Expand this as 
a—A b 

c d—xX 


| =  —(a+d)\+ (ad — be) = 0. 
(c) Solve this quadratic equation to find the two values of A, which 
are the required eigenvalues. 


For the above 2 x 2 matrix A, the eigenvalues are 


A=} (att V@—d? +4). 


To find the eigenvector corresponding to the eigenvalue A, do the 
following. 


(a) Write down the eigenvector equations 
(a—A)xa+ by = 0, 
cz + (d— A)y = 0. 
(b) This pair of equations typically reduces to a single equation that 
is readily solved for « and y. The eigenvector is given by 
v=[x y]", with x and y replaced by their solved values. Any 
non-zero scalar multiple is also an eigenvector. 


The trace of a square matrix is the sum of the elements on its leading 
diagonal. The trace of A is denoted by tr A. 


The characteristic equation of a 2 x 2 matrix can be written as 


7 -—trAA+det A =0. 


18. 


19. 


If the elements of the above 2 x 2 matrix A are real, then the value of 
the discriminant D = (a — d)? + 4be characterises its eigenvalues: 


e If D>0, then there are two distinct real eigenvalues. 


e If D<0, then the two eigenvalues are complex conjugates of each 
other. 


e If D=O, then the two eigenvalues are identical. 


There are a number of general rules that apply to the eigenvalues and 
eigenvectors of an n x n matrix A. To understand these rules, we need 
some definitions about matrices: 


e A real matrix is one whose elements are all real. 

e Asymmetric matrix is one that is equal to its own transpose: 
A= A?, 

Eigenvalue rules 

e The product of the eigenvalues of A is det A. 

e The sum of the eigenvalues of A is trA. 


e Any complex eigenvalues of a real matrix occur in complex 
conjugate pairs, i.e. if A is a complex eigenvalue, then so is X. 


e The eigenvalues of a triangular matrix are the diagonal entries. 
e The eigenvalues of a real symmetric matrix are real. 


e A matrix is non-invertible if and only if at least one of its 
eigenvalues is 0. 


Eigenvector rules 


e Ifa matrix has n distinct eigenvalues, then the corresponding 
n eigenvectors are all linearly independent. 


e Fora real matrix, the components of an eigenvector corresponding 
to a real eigenvalue can be chosen to be real. 


e For a real matrix, if A is a complex eigenvalue with corresponding 
eigenvector v, then Vv, the vector whose components are the 
complex conjugates of those of v, is an eigenvector corresponding 
to the eigenvalue 1. 


e For any real symmetric matrix A, the eigenvalues are real, and 
the eigenvectors v; may be chosen to be real and orthogonal to 
each other, i.e. v; > vj = 0 for 2 Fj. 


Unit 5 summary 


Using matrices, the inner 
product v; + vj; is calculated 
using v) v;. 
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Note that there are exceptions to 
this rule, but we do not consider 
such cases in this module. 
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Unit 6 Systems of linear differential equations 


1. 


Ds 


A system of linear constant-coefficient first-order differential 
equations for unknowns 71, %2,...,2, takes the form 


Hy = A410, + AyQ%Q + +++ + Ainin + hi(t), 
£2q = aq 41 + ag2%2 +--+ + Gontn + ha(t), 


Dn = Apri, Gnade i + Opn Ba + Pinal t) 
where there is the same number n of equations as of unknowns. 


The independent variable t often represents time. The dependent 
variables are x(t), Z2(t),...,2n(t); but for systems with n = 2 or 
n = 3, 2, y and z are often used instead of 71, rg and x3. The 
coefficients aj; are constants. 


The equations may be expressed in matrix form as 


x=Ax-+h, 
where 
eH |e_ Be xa Bel and 2 ty He ons byl: 


here A = [{a;;] is a constant matrix called the matrix of coefficients, 
but the vector h=[h} ho ...  hny|” may depend on t. 


The general solution of a first-order system of n equations can be 
expressed as a solution containing n arbitrary constants. 


A particular solution is a solution containing no arbitrary constants 
and satisfying given conditions. 


Particular solutions of systems of n first-order differential equations 
are usually obtained by demanding that the general solution satisfies 
n initial conditions. These conditions fix the n arbitrary constants. 


A system with h = 0, i.e. of the form x = Ax, is said to be 
homogeneous. An inhomogeneous system is one where h + 0. 


To find the general solution of x = Ax, where A is an n X n matrix, 
proceed as follows. 


If the matrix A has only real eigenvalues: 


e Find the eigenvalues 1, A2,..., An and corresponding eigenvectors 
V1, V2,---,Vn of the matrix A. 


e Write down the general solution in the form 
e=Cpne + Coyote + Chvne”, 


where C1, Co,...,Cp are arbitrary constants. 


Unit 6 summary 


If the matrix A has some complex eigenvalues (which must occur in 
complex conjugate pairs \ and 4, with corresponding complex 
conjugate eigenvectors v and V), then a further step is necessary: 


e Replace the complex terms ve and ver! appearing in the general 
solution with Re(ve*’) and Im(ve™). 


The general solution will then be real-valued for real 
C1, Co,...,Ch. 


The above procedure fails if A does not have n linearly independent 
eigenvectors. This case is not covered in this module. 


The complementary function of an inhomogencous system 

x = Ax +h is the general solution of the corresponding 
homogeneous system x = Ax. A particular integral of the 
inhomogeneous system is any solution of it containing no arbitrary 
constants. 


The general solution of the inhomogeneous system x = Ax + h takes 
the form x, + Xp, where x, is the complementary function and xp is a 
particular integral. 


To find a particular integral x, = [zp yp|’ of x = Ax +h, where A 
is a 2 x 2 matrix, try a solution constructed as follows. A similar procedure works for 
systems of more than two 


e When the elements of h are polynomials of degree k, choose x, eqaRéione: 


and yp to be polynomials of degree k. 


e When the elements of h are multiples of the same exponential 
function, choose x» and y, to be multiples of this exponential 
function. 


The trial solution will have a number of undetermined coefficients. 
To find their values, substitute the trial solution into the system of 
differential equations and equate coefficients of corresponding terms. 
This will give a number of equations that can be solved for the 
coefficients. 


A system of homogeneous linear constant-coefficient 
second-order differential equations for unknowns 21, %9,...,%n 
takes the form 


Ly = 44121 + 19%Q + +++ + GinEn, 


LQ = Ag, L1 + AQ9%Q + +++ + Anln, 


Ln = Ani XL] + An2QLa + +++ + AnnXn, 
where there is the same number n of equations as of unknowns. 
The equations may be expressed in matrix form as 

X= Ax, 


where A is a constant matrix. 
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Complex eigenvalues and 
repeated real eigenvalues are not 
discussed, but they can be dealt 
with by generalising here. 


In this module, unless told 
otherwise, you may assume that 
functions are ‘sufficiently 
smooth’ for their derivatives to 
be defined and well-behaved. 
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To solve a system X = Ax, where A is an n x n matrix with n distinct 
real eigenvalues, do the following. 


e Find the eigenvalues 1, A2,--.,An of A, and a corresponding set 
of eigenvectors v1, V2,.-.,Vn- 


e Each positive eigenvalue A, corresponding to an eigenvector v, 
gives rise to two linearly independent solutions 


Jxt Vat 


ve and ve 


Each negative eigenvalue , corresponding to an eigenvector v, 
gives rise to two linearly independent solutions 


vceosV—At and vsinV—At. 


A zero eigenvalue corresponding to an eigenvector v gives rise to 
two linearly independent solutions 


v and vt. 


e The general solution is then an arbitrary linear combination of the 
2n linearly independent solutions found in the previous step, 
involving 2n arbitrary real constants. 


Systems of second-order differential equations arise in the study of 
oscillating systems. A normal mode of oscillation is one in which 
all of the coordinates of the system oscillate sinusoidally with the same 
angular frequency. 


Unit 7 Functions of several variables 


A function f(2,y) defines a surface z = f(x,y), which can be 
visualised in a perspective view. If one of the variables is given a fixed 
value (y = a, say), then the corresponding section function f(x, a) 
can be plotted as a normal graph. A contour map for f(x,y) is a 
collection of contour lines in the xy-plane. Along each contour line, 
f(x,y) has a constant value. 


For a function f(x,y, z) of three variables, the contour lines are 
replaced by contour surfaces. 


The partial derivative Of /Ox of a function f(x,y) is obtained by 
differentiating f(x,y) with respect to x, treating y as a constant. It is 
also written as f,(z,y) or f,. All the usual rules of differentiation 
apply. More formally, 


of 


- — jim fe + 92,¥) = Fey) 
Ox Joly) = a 


6x 


Similar definitions apply to Of /Oy = fy(x,y), and to partial 
derivatives of functions of more than two variables. In a partial 
derivative with respect to a given independent variable, all the other 
independent variables are treated as constants. 


Unit 7 summary 


Functions obtained by partially differentiating a function once are 
called first-order partial derivatives. These can be partially 
differentiated again to obtain higher-order partial derivatives. For 
example, 


aed Of Of _ afofy.__ 
ee Foo and Ste = oe (SE) = So 


are second-order partial derivatives. 


For a sufficiently smooth function f(x,y), the mixed partial 
derivative theorem states that 
ay O° f ; 
Ox Oy = By Ox’ or equivalently Soy = Sya- 


A similar result applies to a function f(21,22,...) of n variables. 
Again, the order of differentiation does not matter, so 
ory 7 Ca 
Ox; On, 7 On; Ox; 


for all x; and x;. 


The chain rule of partial differentiation takes several different 
forms. 


e Chain rule for small changes If f = f(x,y), and the independent 
variables x and y change by small amounts 6z and dy, then the 
corresponding small change in f is 

a) 
Of = a oe + of oy. 
Oy 

e Chain rule “- differentiation with respect to a parameter If 
f =f(«,y), and the independent variables « = x(t) and y = y(t) 
are functions of a parameter t, then the rate of change of f with 
respect to t is 


df Of dx | Of dy 


dt Ox dt | Oy dt 


e Chain rule for a change of variables If f = f(x,y), and the 
independent variables x = x(u,v) and y = y(u,v) are functions of 
two other variables, u and v, then the partial derivative of f with 
respect to wu is 


Of _ Of Ox Of Oy 
du Ox Ou Oy Ou’ 


with a similar result for the partial derivative of f with respect 
to v. 


All these chain rules can be extended to functions of more than two 
variables. For example, if f = f(x,y, z), then the chain rule for small 
changes becomes 

Of Of Of 


ee i 
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Given a surface z = f(x,y), and a direction in the xy-plane defined by 
a unit vector n = cos @i+ sin @j, the slope of the surface in the 
direction of the unit vector is 


slope = cos Sf euag@t =e fo + Ny fy. 


Oy 
For a function f(x,y), the gradient vector is vector-valued function 
of x and y in the xy-plane given by 
OF. , Of s 
rad f= —i+—j. 
grad f an ay 
It has the following properties: 
e Fora unit vector n in the xry-plane, the slope of the surface 
z = f(x,y) in the direction of the unit vector is 
slope = n- grad f. 
e grad f points in the direction in the xy-plane that gives maximum 
slope, and its magnitude is equal to the maximum slope. 


e At any given point, grad f is perpendicular to the contour line 
through the point. 


The first-order Taylor polynomial for f(x,y) about (a,b) is 
pi (a, y) = f(a, b) + Folate b)(x = a) + fy(@, b)(y ~~ b). 


This matches the values and first-order partial derivatives of f(x, y) 
at (a, b). 
The plane given by 


z= pi(z,y) 


is called the tangent plane for f(x,y) at (a,b). For any direction in 
the xy-plane, the slopes of the function f(z, y) and the corresponding 
tangent plane are identical at (a, b). 


The second-order Taylor polynomial for f(x,y) about (a,b) is 


p2(x, y) = pilx,y) 
Si 5 (fux(a, b)(x — a)? a 2 fay(a, b) (x = Gite 6) + Syy(@, b)(y — b)”) . 

This matches the values and first- and second-order partial derivatives 
of f(x,y) at (a, 0). 
A point (a,b) inside the domain of f(x,y) is a stationary point if 
both fz(a,b) and fy(a,b) are equal to zero. There are three types of 
stationary point. 
e A local minimum occurs at (a,b) if there is a small region 

around (a,b) within which f(x,y) > f(a,6) at all points 

(x,y) # (4,0). 


11. 
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Unit 7 summary 


e A local maximum occurs at (a,b) if there is a small region 
around (a,b) within which f(x,y) < f(a, b) at all points 
(x,y) # (a, 6). 

e A saddle point is a stationary point that is neither a local 
minimum nor a local maximum. Through such a point, some 
paths climb to higher function values while others descend to 
lower function values. 


Stationary points are found by taking all the first-order partial 
derivatives of the given function, setting them equal to zero, and 
solving the resulting set of simultaneous equations. For example, the 
stationary points of f(a, y) occur at points (a, y) that satisfy the 
simultaneous equations 


fe(t,y)=0 and fy(x,y) =0. 


The Hessian matrix is the matrix of the second-order partial 
derivatives of f. 


To classify the stationary point at (a,b) of a smooth function f (2, y), 
do the following. 


(a) Find the second-order partial derivatives, and evaluate them at 
the stationary point. 


(b) Construct the Hessian matrix H at the stationary point, and find 
its eigenvalues. 


(c) Apply the following rules: The test is inconclusive if any of 


: 0.9 he eigenval r Br : 
e If all the eigenvalues are positive, then we have a local ee eee Oe eae 


minimum. 


e If all the eigenvalues are negative, then we have a local 
maximum. 


e If the eigenvalues have mixed signs, then we have a saddle 


point. 
Alternatively, for a function f(x,y) of two variables, the determinant 
test can be used. In this case the Hessian matrix can be written as The Hessian matrix of a smooth 
—_ fra (a, b) Fon a, ‘| _ | A al . function is always symmetric. 
Figs @s b) fyy (a, b) BC 


The stationary point is: 

e a local minimum if det H = AC — B? > 0 and A> 0 We could equally well use C 

e a local maximum if det H = AC — B? > 0 and A <0 oe la 
e asaddle point if det H = AC — B? < 0. 


The determinant test is inconclusive if det H = 0. 
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Unit 8 Multiple integrals 


1. 


The area integral of a function f(x,y) over a rectangular region S in 
the xy-plane, bounded by the lines x = a, x = b and y= c, y = d, can 
be found by two successive integrations. The inner integral is always 
performed first. For example, we can write 


[ f@waa= / “ ( ec i) dy, 


where the integral over x is carried out first, with y held constant. 
This gives a function of y, which is integrated over y. 


Alternatively, the same area integral can be written as 


[ie dA = i (a f(a,y) iy) dx. 


In this case, the integral over y is carried out first, treating x as a 
constant; the result is then integrated over x. 


The volume integral of a function f(x,y, z) over a cuboid region R, 
bounded by the planes x = a), © = a2, y = 04, y = b2, 2 = cy and 

Zz = Cg, can be found by three successive integrations. The innermost 
integral is evaluated first, and successive integrations move 
progressively outwards. For example, we can write 


[emsar= [0 [5 emt) 


where the integral over z is done first, with x and y held constant. 
The result is integrated over y, with x held constant, and the final 
integral is over x. 


Alternative orderings may be chosen. Inner integrals are always 
completed before integrals placed outside them. The limits of 
integration in a given definite integral can depend only on the variables 
of integration in integrals that lie further outside it (and are done 
after it). The limits of the outermost integral are always constants. 


In an area integral, if the function to be integrated takes the product 
form f(x,y) = g(x) h(y) and the limits of integration are all 
constants, then the area integral can be expressed as a product of 
definite integrals over x and y: 


[tena | "90 dat x / - h(y) dy. 


In a volume integral, if the function to be integrated takes the product 
form f(x,y, z) = u(x) u(y) w(z) and the limits of integration are all 
constants, then the volume integral can be expressed as a product of 
definite integrals over x, y and z: 


w=a2 y=b2 z=c2 
[fens dV = ula) de i u(y) dy x | w(z) dz. 


=a, =b1 SC]. 


However, such factorisations do not work in general. 


Unit 8 summary 


To find the area integral of a function f(x,y) over any region S in the 
xry-plane, start by choosing which integral to do first — that over x or 
that over y. The following steps assume that the integration over y is 
done first. yA 


(a) Draw a diagram showing the region of integration S. B(a)4 


(b) Draw a vertical strip parallel to the y-axis, centred on x, and 
spanning the region (as shown in the diagram). Determine the 
lower limit y = a(a) and the upper limit y = {(z) for this strip. 
These are the limits for the y-integration (the inner integration). 
In general, they are non-constant functions of x. 


o(z)}- Seas 

+ 

(c) Determine the minimum value x = a and the maximum value - _ b 
x = b for x-values throughout the region. These are the limits for 
the x-integration (the outer integration), and are always 
constants. 


Sketching a region of 
integration when the integral 
over y is done first 

(d) Write down the area integral as 


[fen dA = [- _ f(x,y) iy dx. 


(e) Evaluate the inner integral over y first, holding « constant, and 
substitute in the limits of integration. This gives a function 


y=h(x) 
g(x) = i f(x,y) dy. 
y 


=a(x 
(f) Evaluate the remaining definite integral of g(x) over x. 


If you choose to do the x-integral first, then the limits of integration 
must be found using a different sketch, with horizontal strips running 
parallel to the z-axis. The lower and upper limits of the z-integration 
are then functions x = u(y) and x = v(y), and the lower and upper 
limits of the y-integration are constants y = c and y = d. The integral 
is then written in the form 


Hagas= f(T saa |e 
I ee Vea 


The integral over x is done first, with y held constant; the result is 
then integrated over y. 


The volume integral of a function f(x,y, z) over any region R can be 
written as 


/ f(z,y,z)dV = | / / f(x,y, 2) dz | dy | dz. 
JR Jxr=a e y=a(a) . z=u(x,y) 


The limits of integration can be found by drawing two diagrams — a 
perspective view of the region R, and a projection of R onto the 
xy-plane. The limits of the z-integration are found from the 
perspective view, by considering a thin column centred on a typical 
point (x,y). The limits of the y-integration (which are functions of x) 
are found by considering a typical thin slice parallel to the y-axis, 
represented by a thin strip in the projection onto the ry-plane. 
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The constant limits for the outer z-integration are also found from 
this projection. This method can be adapted for different orders of 
integration. 


The area of a region in the ry-plane is equal to the area integral of the 
function f = 1 over the region. Similarly, the volume of a region in 
three-dimensional space is equal to the volume integral of the function 
f =1 over the region. 


Polar coordinates (r,¢) are related to Cartesian coordinates (x, y) 
by the equations 

zr=rcosdé, y=rsing. 
In polar coordinates an area element has area 

6A = ror dd. 


The area integral of a function f(r,@) over a disc S of radius R 
centred on the origin is 


o=27 r=R . 

[ears [Of se.oyrar) a 
Ss o=0 r=0 

where we have chosen to integrate first over r, and then over ¢. The 


reverse ordering is also valid. The limits of integration can be adjusted 
for regions occupying parts of a disc. 


Cylindrical coordinates (r,¢, z) are related to Cartesian 
coordinates (x,y,z) by the equations 


z=rcosd, y=rsing, z=z. 


In cylindrical coordinates a volume element has volume 


OV =r dr dodoz. 


# - / hs if 
23 


(b) 


(a) Cylindrical coordinates; (b) a volume element in cylindrical coordinates 


10. 


The volume integral of f(r, ,z) over a cylindrical region D, centred on 
the z-axis with radius R, and with flat surfaces at z = 0 and z =A, is 


z=h p=2r r=R 
[ fos. z)dV = - (se (f flr.a, 2)r ar) a) dz. 


Alternative orderings are also valid. The limits of integration can be 
adjusted for regions based on parts of a cylinder. 


A region that is axially symmetric around the z-axis has volume 
2=20 
Ven |” (real) — reine) de 
2=21 


where rmin(z) and rmax(z) are the minimum and maximum radii of the 
region for a given value of z, and z; and zg are the minimum and 
maximum values of z. For a solid object with no holes, ryin(z) = 0. 


Spherical coordinates (r,6,¢) are related to Cartesian coordinates 
(x,y,z) by the equations 


x=rsinOcos¢, y=rsindsingd, z=rcosé. 


In spherical coordinates a volume element has volume 


dV =r’ sin 0 dr 60 5d. 


(a) Spherical coordinates; (b) a volume element in spherical coordinates 


11. 


The volume integral of f(r,6,@) over a spherical region of radius R, 
centred on the origin, is given by 


o=20 6=n r=R 
= se 
I= a (f. s f(r,8,0)r sin dr) i) do. 


Alternative orderings are also valid. The limits of integration can be 
adjusted for regions based on parts of a sphere. 


A coordinate line is a line along which one coordinate varies while 
the other coordinates remain fixed. 


A coordinate system is said to be orthogonal if its coordinate lines, 
for different coordinates, meet at right angles. Cartesian, polar, 
cylindrical and spherical coordinate systems are all orthogonal. 


Unit 8 summary 
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12. For any coordinate u, the length of the segment of the u-coordinate 


13. 


line between u and u + du, where du > 0, is expressed as 
length of segment = h, du, 


where h,, is called the scale factor for the u-coordinate; this may be a 
function of the coordinates. 


If the coordinates (u,v, w) are related to Cartesian coordinates by 
equations of the form 


r=a(u,v,w), y=y(ur,w), y=2(u,v,w), 


then the scale factor h, is given by 


2 2 2 
hg = Ox + Oy + Oz ; 
Ou Ou Ou 
with similar formulas for h, and hy. 


Scale factors in some orthogonal coordinate systems are given in the 
table below. 


Cartesian (x, y) hie = 1, ty =1 

Cartesian (7,y,z) hg=1,hy=1,h,=1 
Polar (r, ¢) hg S 1, hig =" 

Cylindrical (7,0,2) p= 1, hg =, he = 1 
Spherical (r,0,¢) hry=1,he=r, hg =rsind 


In any orthogonal coordinate system (u,v) with scale factors hy, 
and h,, an area element has area 


6A = hy hy du dv. 


In any orthogonal coordinate system (u,v, w) with scale factors hy, hy 
and hy, a volume element has volume 


OV = hy hyhy du dv dw. 


A surface can be parametrised by two coordinates (u,v). These are 
related to Cartesian coordinates by equations of the form 


f=—au,8), gy=oyu,2), 2=— e(t,v). 


Using these equations, we define the Jacobian vector 


i j k 
Ox Oy Oz 
J= du du dul; 
Ox Oy Oz 
Ov dav Ov 


which is perpendicular to the surface. 


Unit 9 summary 


Then a surface area element has area 
6A = |J| du dv, 
where |J| is the magnitude of J. 


For a surface region S with wy <u < ug and vy <v < vg, where the 
limits of w and wv are all constants, the surface integral of a function 


f(u, v) is 


i fdA= | - ( / _ Push du) es 


For f = 1, this gives the area of the surface region S. If the surface S' 
is part of the surface of the sphere of radius R centred at the origin, 
and (u,v) are the spherical coordinates (0, ), then J = R’sin@e,. If 
S' is part of a cylinder of radius R with axis along the z-axis, and 
(u,v) are the cylindrical coordinates (@, z), then J = Re,. 


Unit 9 Differentiating scalar and vector fields 


L. 


A field is a quantity with definite values at points throughout a region 
of space. Scalar fields and vector fields describe the distribution of 
scalar and vector quantities, respectively. At any given point: 


e The value of a scalar field is independent of the orientation of the 
coordinate system. 


e The magnitude and direction of a vector field are independent of 
the orientation of the coordinate system. (This implies that the 
components of a vector field depend on the choice of coordinate 
system.) 


In a coordinate system (u,v, w), the unit vector e,, associated with 
the coordinate wu is a vector of unit length pointing in the direction in 
which w increases while the other coordinates v and w are held 
constant. In orthogonal coordinate systems, the unit vectors are 
mutually orthogonal. They may vary from point to point. 


In Cartesian coordinates (x,y,z), the unit vectors are 
& =i, €=Jj, =k, 
In polar coordinates (r,@), the unit vectors are 
e, =cosdi+singj, eg = —sindi+cos¢j. 
In cylindrical coordinates (r,,z), the unit vectors are 
e, =cosdi+singj, eg = —sindi+cos¢j, e, =k. 
In spherical coordinates (r,0,@), the unit vectors are 
e, = sin@cos¢i+sin@sin dj + cos dk, 
eg = cosPcos¢i-+ cos@sin dj — sin#k, 
eg = —singi+ cos@j. 
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To convert a vector field F from Cartesian coordinates into another 
orthogonal system (u,v, w), for each component do the following. 


(a) Write down 
Fy, =e: F, 
and expand the scalar product on the right-hand side using 
Cartesian expressions for e,, and F (involving i, j and k). 
(b) The resulting expression generally depends on (z, y, z). Use the 
coordinate transformation equations in the form 
f= 2(4,0,w), youE,v.,0), 2— 24, v,w); 
to obtain an expression for Fi, solely in terms of wu, v and w. 
Once this has been done for each component, write down 
F = Fie, + Fy ey + Fw ew. 


Given a scalar field V(x, y, z), the corresponding gradient vector 
field is 
OV OV OV 
VV = —i+—j+—k. 
Ox Po Oy ee Oz 


This is also called the gradient of V. 
At each point, the gradient has the following properties: 


e = Its direction is that in which V increases most rapidly. This 
direction is perpendicular to the contour surfaces of V. 


e Its magnitude is the maximum rate of increase of V with respect 
to distance travelled in three-dimensional space. 


e The gradient of a scalar field is a vector field, so its magnitude 
and direction are independent of the orientation of the coordinate 
system. 


e Fora small displacement ds = 6x1 + dyj + 6zk, the corresponding 
small change in V is 


6V = VV - os. 
e The rate of change of V with distance in the direction of the unit 
vector n is equal to the component of VV in the direction of n: 
rate of change of V =n- VV. 


In polar coordinates, 
OV 10V 
VV=— — — e€y. 
Or . r Od “ 
In cylindrical coordinates, 


OV 1 OV OV 


We op poe oe 
In spherical coordinates, 
OV 1 OV 1 OV 
v= On ot 99 8 t rind Ob °* 


10. 


Unit 9 summary 


Given a vector field F = F, i+ Fy j+ F.k, the corresponding 
divergence is 

OF, . Of, . OF, 

dr Oy Oz- 

The divergence of a vector field F is a scalar field. It describes the 
extent to which F flows outwards or diverges from each point. 


V:F= 


The alternative notation div F is 
often used. 


In polar coordinates, 
_ 100°) | 1 aFs 
Or r Ob- 
In cylindrical coordinates, 
1Orr,) . 1OFs oF, 
r Or r Oo Oz 
In spherical coordinates, 
1 O(r?F,) 1 O(sinOFy) 1 OF, 
r2 Or a ee ae rsind Od 
Given a vector field F = F, i+ Fy j+ F, k expressed in a right-handed 
Cartesian coordinate system, the corresponding curl is 


V-F= 


V-F= 


i j k 
O O O 
VxF= Se Be Bel’ The alternative notation curl F 
is Y = is often used. 
Ba Py # 


In the determinant, the partial derivative operators in the second row 
act on the components in the third row. Expanding out the 
determinant gives 


Oy Oz Ox Oz Ou Oy 
The curl of a vector field F is another vector field. It describes the 
extent to which F rotates or swirls locally about each point. (The 
sense of the rotation is given by the right-hand rule.) 


In polar coordinates, 
1 (O(rFy) OF, 
F=- - ofa 
V XxX ( a, ab e 


* 
In cylindrical coordinates, 


e Treg e, 


O ) O 
VRE olor te De 
FE, rF, F, 


In spherical coordinates, 
er reg rsinfeg 
_ 1 joa a 
~ r2sind|Or 00 relo) 
F, Fo rsind Fy 


VxF 
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This formula can be extended in 
an obvious way for paths in 
three-dimensional space. 
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Integrating scalar and vector fields 


A path is a curve with a definite sense of progression from a start 
point to an end point. If the start and end points are identical, then 
the path is closed; otherwise, it is open. 


A path can be represented by a set of parametric equations of the 
form 


saa, vert) e220) Geist), 


where x, y and z are Cartesian coordinates of points on the path, and 
the parameter t increases monotonically, from ¢t = t; at the start point 
to t = t) at the end point. For a path in the ry-plane, we need only 
the parametric equations for x and y. 


A path in three-dimensional space, with parametric equations 
r= x(t), y = y(t), z= 2(t) (ty <t< to), 
has length 


If the path is restricted to the xy-plane, then this simplifies to 


te dx\? dy 
v= [VGae) + Ge 
ts dt dt 
The line integral of a scalar function \(z,y) along a path with 
parametric equations « = x(t), y = y(t) for t) < t < te is given by 
te dx \? dy . 
M= A(a(t t — —) dt. 
[* neato.) (GE) + (GB) 


If A(x, y) is the mass per unit length along the path, then the line 
integral gives the total mass along the path. 


To calculate the line integral of a vector field 
F = F,i+ Fyj+ Fk 
along a path C with the parametric representation 
c=x(t), y=y(t), z=2(t) (i <t<ty), 
do the following. 
(a) Use the parametric representation to find 
oa Sit DiS 
(b) Express the components of F as functions of the parameter t. 
(c) Find the scalar product 
ds dx dy dz 


Be a Oe ae 


as a function of t. 


(d) Evaluate the line integral as a definite integral over t: 


te 
[Po 7 
C iy dt 


A vector field F is called a gradient field if it can be expressed in the 
form 


F =—VU, 
where U is a scalar field, known as the scalar potential field 
associated with F. 
Line integrals of gradient fields have the following properties: 


e Any line integral of a gradient field is path-independent (i.e. it 
does not depend on the detailed path between given start and end 
points). 

e =©Any line integral of a gradient field around a closed loop is equal 
to zero. 


A vector field F is said to be conservative if, throughout its domain 
of definition, all its line integrals are path-independent. 


The line integral of a conservative field F along a path C from a start 
point A to an end point B may be written as 


| F - ds, 
A+B 


as there is no need to indicate the precise path C. 


For a conservative field F with an associated scalar potential 


field U(r), 


| FS te)— Ue). 


All gradient fields are conservative, and all conservative fields are 
gradient fields, so the terms conservative field and gradient field are 
synonymous, and can be used interchangeably. 


Given a conservative vector field F, the corresponding scalar potential 
field is defined as follows. We choose a point ro as the position of zero 
scalar potential; by definition, U(ro) = 0. At any point r, the scalar 
potential field is then given by the line integral 


V(t) = -{ F -ds, 
ro7r 


where the path may be taken to be a straight line from rp to r, or any 
other choice that simplifies the integration. A useful check is that U(r) 
should satisfy 


F =—VU. 


Unit 10 summary 
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In this module all regions are 
simple. 
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An oriented area 
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10. 


11. 


The curl test is used to determine whether a given vector field F is 
conservative in a simple region (such as R?). The vector field F is 
conservative if and only if V x F = 0 throughout the region. 


If n is a unit vector perpendicular to a tiny planar surface element of 
area 6S, then the oriented area of the element is 


6S = 6S n. 


Given a vector field F and a planar element with oriented area 6S, the 
flux of the vector field over the element is defined as 


flux = F - 6S, 


where the field F is evaluated at the position of the element. This is 

the normal component of the field (i.e. the component in the direction 
of the unit normal) multiplied by the area of the planar element. Flux 
is a scalar quantity that can be positive, negative or zero depending on 
the relative orientations of F and the unit normal n. 


A closed surface is one that divides space into two regions — an 
exterior space and an interior space. An open surface is one that is 
not closed. When a surface is approximated by many small planar 
elements, neighbouring elements are chosen to have unit normals that 
are almost parallel (rather than almost antiparallel). For any closed 
surface, all the unit normals are chosen to point outwards into the 
exterior space, rather than inwards towards the enclosed volume. 


Suppose that a surface S is parametrised by (u,v), for uj <u < ug 
and vy <v < v2, where the minimum and maximum values of the 
parameters are constants. Then the flux of a vector field F over a 
surface S' is given by the surface integral 


Vv=v2 u=u2 
ux =f -as= | (/ F-Jdu) db, 
Ss V=U1 U=U1 


where 
i j k 
Ox Oy Oz 
J=jdu du du 
Ox Oy Oz 
dv Ov av 


is the Jacobian vector in item 13 for Unit 8. To evaluate the surface 
integral, the integrand F - J must be expressed in terms of the 
parameters wu and v. 


On the surface of a sphere of radius R, centred on the origin and 
parametrised by (0, ¢) of spherical coordinates, 


J = R’sinée,, 
where 
e, = sin@cos¢i+ sin dsin dj + cosdk 


is the radial unit vector of spherical coordinates. 


1. 


13. 


14. 


15. 


16. 


Le. 


Unit 10 summary 


The divergence of a vector field F at a given point is related to the 
flux of F over a tiny surface enclosing the point. In the limit where 
the surface area and its enclosed volume shrink to zero, 


flux of F over surface 
V °F aoaoO _  — —_. 
volume enclosed by surface 
So the divergence of a vector field at any point can be interpreted as 


the flux per unit volume at that point. 


Given a vector field F and a closed surface S enclosing a volume V, 
the divergence theorem states that 


[F-«s= V-Fdv. 
Ss V 


This is also called Gauss’s theorem. 


The unit normal of a planar element and the sense of positive 
progression around the perimeter of the element are related by the 
right-hand grip rule: with the thumb of your right hand pointing in 
the direction of the unit normal of a planar element, the curled fingers 
of your right hand indicate the sense of positive progression around 
the perimeter of the element. 


Given a vector field F and a closed path C, the circulation of F 


around C is given by the line integral The right-hand grip rule 


circulation = ¢ F - ds. 
C 


The circle on the integral sign is optional: it is used to indicate that 
the path is closed. If C is a closed path around a planar element with 
a given unit normal, then it is understood that C' is traversed in the 
positive sense determined by the right-hand grip rule. 
Given a vector field F in the vicinity of a given point, the component 
of V X F in the direction of the unit vector n can be found by taking 
a planar element with unit normal fi at the point. In the limit where 
the element becomes very small, 

(V x F) a= circulation around perimeter of element 

area of element 

So each component of the curl at a given point can be interpreted as a 
circulation per unit area at that point. 


If F is a vector field and S is an open surface with perimeter C’, then 
the curl theorem states that 


freas= [(vxF)-as, 


where it is assumed that C is traversed in the sense determined by the 
right-hand grip rule. This result is also called Stokes’s theorem. 
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Unit 11 Fourier series 


if, 


A function f(t) is periodic if there is some positive number \ such 
that f(t+ A) = f(t) for all t. The number ) is a period of the 
function f. If A is a period of a function, then so is nA for any positive 
integer n. The fundamental period of a periodic function is the 
smallest (positive) value for the period. The symbol 7 is often used for 
the fundamental period. Unless it is specified otherwise, assume that 
‘period’ means ‘fundamental period’. A fundamental interval for a 
periodic function is any interval whose length is the fundamental 
period. 


The function f(t) = cos(wt) is periodic with fundamental period 27/w. 
The constant w is called the angular frequency. 


A function f(t) is even if 
f(—t) = f(t) for all values of t. 
For any integer k, cos(kt) is even. 
A function f(t) is odd if 
f(—t) =—f(€) for all values of t. 
For any (non-zero) integer k, sin(kt) is odd. 


Even and odd functions combine under addition and multiplication as 
follows. 


e The sum of two even functions is even. 

e The sum of two odd functions is odd. 

e The product of two even functions is even. 

e The product of two odd functions is even. 

e The product of an even and an odd function is odd. 


The integrals of even and odd functions can be simplified when the 
range of integration is symmetric. 


e If f(t) is even, then 
f(t) dt =a f(@) dt. 
—a 0) 
e If f(t) is odd, then 


" f(t)dt =0. 


—-a 


6. 


A Fourier series is an infinite series of trigonometric functions of the 
form 


21 4 
Ay + Ay 00s ( =x) + A.cos ( a) + 
T T 
Qrt Art 
+ Bysin (= *) 4 Bysin (=) po, 
= 


where the constants Ag, A;, Ao,...,B,, Bo,... are called the Fourier 
coefficients, and 7 > 0. 


A Fourier series defines a periodic function whose fundamental period 
is T. 

To find the Fourier series for a periodic function f(t), do the 
following. 

(a) Find the fundamental period r. 


(b) Write down the Fourier series 


= Qnat = Qnit 
F(t) = Ap + y Ancas( - )+ yo Basin (AT). 
n= n= 


where Ap and the A, and B,, are coefficients to be determined. 
Simplify the arguments of the sines and cosines where possible. 


(c) Use the following formulas to determine the Fourier coefficients: 


1 7/2 
Ap = - S(t) dt 
a —7/2 
nia 2nat 
An==f s(deos(7™) ar (w= 1.2...) 
Oe ae 2n7 
Bn=- ni Deb 
= [ssn (A - ma (n= pele 


(d) If desired, express the final Fourier series in a compact form with 
general formulas for its coefficients. 


To find the Fourier series for an odd periodic function f(t), do 
the following. 


(a) Identify f(t) as being odd, and find its fundamental period rT. 
(b) Write down the Fourier series 
= 2n1t 
(t) 2 sin ( - ) 


(c) Find the coefficients by evaluating the definite integrals 


=!" 1ose(2a 


(d) If desired, express the final Fourier series in a compact form with 
general formulas for its coefficients. 
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To find the Fourier series for an even periodic function f(t), do 
the following. 


(a) Identify f(t) as being even, and find its fundamental period r. 


(b) Write down the Fourier series 


. 2 
F(t) = Ag+ > An COs (=) : 


n=1 


(c) Find the coefficients by evaluating the definite integrals 


D) 7/2 
Ae [ f(t) dt, 
T JO 


7/2 
Ay, = - | f(t) cos (==) dt. 
F JO T 


(d) If desired, express the final Fourier series in a compact form with 
general formulas for its coefficients. 


If f(t) is a periodic function, then any interval of length equal to its 
fundamental period 7 can be used to evaluate the Fourier coefficients. 


The following identities are often useful when evaluating Fourier 
series: if n is an integer, then 

cos(n7) = (—1)”, 

sin(n7) = 0, 


_14)n/2 
_ (=) = { (-1) for n even, 


2 0 for n odd, 
, (=) _ J 0 for n even, 
SND) ) (-1)43)/2- for n odd. 


The following integrals may be useful for evaluating Fourier 
coefficients: 


[ tsin(at) dt = 5 [sin(at) — at cos(at)] , 


[ teos(at) a= 5 [cos(at) + at sin(at)]. 


The even extension feyen(t) of a function f(t) defined on an interval 
[0, Z’] is the even periodic function with period 2T and fundamental 
interval |[—-T, T] defined by 


_ff@® for0<t<T, 
Feven(t) = ae for -T <t <0, 


Jown(é = 2T) = Seven) 


14. 


15. 


16. 


The odd extension foqq(t) of a function f(t) defined on an interval 
(0, 7] is the odd periodic function with period 2T and fundamental 
interval [(—T, 7] defined by 


fie for0<t<T, 
foaal) = { —f(-t) for -T. <t <0, 


foaa(t + 2T) = foaa(t). 


If f(t) is a periodic function with Fourier series F(t), then F 
converges to f everywhere if f is continuous. If f(t) has a 
discontinuity at t = to, then 


F(to) = alf (tg) + f(t], 
where f(t7) is the limit of f(¢) as t approaches to from above, and 
f (to ) is the limit of f(t) as t approaches to from below. 


If a continuous periodic function f(t) with fundamental period 7 has 
Fourier series F(t), then its derivative f’(t) has the same fundamental 
period 7, and its Fourier series is given by F’(t). 


Unit 12 Partial differential equations 


1. 


A partial differential equation is an equation relating a dependent 
variable and two or more independent variables through the partial 
derivatives of the dependent variable. The order of a partial 
differential equation is the order of the highest derivative that occurs 
init. A partial differential equation is homogeneous if there are no 
terms that are solely functions of the independent variables. It is 
linear if the terms that contain the dependent variable are 
proportional to the dependent variable or to one of its partial 
derivatives. 


Examples of partial differential equations are the wave equation and 
the diffusion equation. 


The one-dimensional form of the wave equation is 
1 Pu _ Pu 
eof x7’ 
where c is a positive constant called the wave speed. 
The one-dimensional diffusion equation is 
Ou ru 


Ot «Ox? 


where D is a positive constant called the diffusion coefficient. 


The diffusion equation is also known as the heat equation. 
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Homogeneous linear partial differential equations satisfy the principle 
of superposition: if ui(,t) and u2(x,t) both satisfy a homogeneous 
linear partial differential equation, then any linear combination 


u(x,t) = ay ui(a, t) + ag ua(z, t) 


(where a; and ag are constants) also satisfies the partial differential 
equation. 


In order to find a unique solution of a partial differential equation for 
u(x,t), we must specify boundary conditions and initial conditions. 


A boundary condition is an equation relating to a specific value of 
x that holds for all time. For example, if an end of a taut string is 
held at a fixed position or a rod is held at a fixed temperature, then 
the boundary condition is u(L,t) = 0 for all t. If the end of a rod is 
thermally isolated, then the boundary condition is 


a 4s = ior all ¢, 
Ox 


An initial condition is an equation that relates to a specific value 

of t (usually t = 0) and specifies u(x,t) at this time. For example, if 
the initial shape of a taut string (or the initial temperature profile of a 
rod) is given by f(x) for 0 < a < L, then the initial condition is 
u(x,0) = f(x) for 0 <a < L. If the initial velocity profile of a taut 
string is g(x) for 0 < a < ZL, then the initial condition is 


—(x#,0)=g9(a) for0O<a<L. 


To use the method of separation of variables to solve a linear 
homogeneous partial differential equation with dependent variable u 
and independent variables x and t, subject to given boundary and 
initial conditions, do the following. 


(a) Write the unknown function u(x,t) as a product of functions of 
one variable: 


“u(x, t) = Xe) TE). 


Find the required partial derivatives of u in terms of the ordinary 
derivatives of the functions X and 7’. For example, 
0?u/Ox? = X" (x) T(t). 

(b) Substitute the partial derivatives found in step (a) into the partial 
differential equation. Rearrange the equation so that each side 
consists of a function of a single independent variable. Equate 
each side of the rearranged equation to the same separation 
constant j, and hence obtain ordinary differential equations for 
X and T. 

For example, the wave equation separates to give the equations 
X" (zx) 1 T(t) _ 


X(z) © TQ) 


(d) 


(e) 


Use the given boundary conditions for u to find boundary 
conditions for X. 


For example, the boundary condition u(0,t) = 0 for all ¢ will give 
the boundary condition X(0) = 0, and the boundary condition 
(Ou/Ox)(L,t) =0 will give the boundary condition X’(L) = 0. 
Solve the differential equation for X, and apply the boundary 
conditions. Consider different choices for the separation 

constant yu. (Typically, the solutions X(x) take a different form 
depending on whether the separation constant is positive, 
negative or zero.) The boundary conditions generally produce a 
discrete set of solutions X,,(a), which are called eigenfunctions, 
and a corresponding discrete set of values y,, for the separation 
constant, which are called eigenvalues. 


For each allowed y,,, determine the corresponding solution T;,(t) 
of the differential equation for T(t). 


Combine X,,(x) and T,,(t) to obtain a family of product solutions 
une, t) = Xy(e)T,(), m= 1,2, 3).0<2 


Express the general solution as an infinite linear combination of 
these product solutions: 


win, t) = ‘s is, Uy Th, te) 
n=1 


Use the initial conditions and results about Fourier series to 
determine (when possible) the coefficients an. 


For example, the general solution for a taut string released from 
rest is 


CO 
2S hes (=) aie ni7ct 

7 EL Gg }* 
n=1 


and if the initial profile of the string is given by the initial 
condition u(2,0) = f(x) for 0 < a < L, then the coefficients A, 
are given by the coefficients of the odd periodic extension of f(x), 
namely 


2 f . nna 
An == f sin (“) f(a) dr, a en 


The general solution of the equation X"(x) =  X (x) splits into three 
cases depending on whether ju is positive, negative or zero (all cases 
have two constants A and B). 


If y is positive, then ps = c? for some c > 0, and the general 
solution is X(#) = Aexp(cx) + Bexp(—cz). 


If is negative, then y. = —k? for some k > 0, and the general 
solution is X(x) = Asin(ka) + Bcos(kz). 
If 4 is zero, then the general solution is X(x) = Ax + B. 
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Unit 13. Non-linear differential equations 


I. 


A non-linear differential equation, or system of differential 
equations, is one that is not linear in at least one of the dependent 
variables. An autonomous differential equation, or system of 
differential equations, is one where the independent variable does not 
appear explicitly. 


This unit is mainly concerned with systems of two coupled 
autonomous differential equations, of the form 

dx dy 

—=u(2, —=v(z,y). 
The system is non-linear if either u or v is a non-linear function. 
For systems of two coupled differential equations, the solution at 
time t can be represented by a point x in a two-dimensional space 
called phase space, with coordinates z, y. 


The phase point x has equation of motion 
x= u(x,y), 
where u(x, y) is a vector field 


u=([u(z,y) v(z,y)]". 


A particular solution [x(t) y(t)]” of the equation of motion 
determines a path in the ry-plane, parametrised by t, whose tangent 
vector at any point (x,y) on it is the vector u(z,y). Such a solution 
curve is called a phase path (or phase trajectory). The xy-plane 
containing the solution curves is called the phase plane, and a 
diagram showing the phase paths is called a phase diagram. 

An equilibrium point (or fixed point) of a system of differential 
equations [¢ y]7 = u(z,y) is a point (xe, ye) such that x(t) = xe, 
y(t) = Ye is a constant solution of the system of differential equations; 
that is, z(t) = 0 and y(t) = 0 at the point (e, ye). 


To find the equilibrium points of x = u(x, y) for a given vector field u, 
solve the equation 


u(z,y) =0 
for x and y. 


An equilibrium point of a system of differential equations is said to be 
stable when all points in the neighbourhood of the equilibrium point 
remain in the neighbourhood of the equilibrium point as time 
increases, and unstable otherwise. 


The matrix 


Ou Ou 
By (tr) By Py) 
J(x,y) = be Ay 


Bq (tr) By mY) 


is the Jacobian matrix of the vector field u=[u v]". 


Unit 13 summary 


Suppose that the system of differential equations x = u(x) has an 
equilibrium point at 7 = Ze, y = Ye. To linearise the system near this 
equilibrium point, do the following. 


(a) Find the Jacobian matrix J(z, y). 


(b) In the neighbourhood of the equilibrium point, the differential 
equations can be approximated by the linearised form 


where a(t) = a+ p(t) and y(t) = ye + q(t). The vector [p q]" is 
the perturbation from the equilibrium point. 


Consider a linear system of differential equations p = Ap. This There are special cases not 
system has a single equilibrium point, at the origin. The nature of the considered in this module (such 
equilibrium point is determined by the eigenvalues and eigenvectors of 28 When an eigenvalue is zero). 
the matrix A, as indicated by the following table. 


Real eigenvalues, ‘. 
distinct P P aN D 


Both positive: source Both negative: sink Differing signs: saddle 


q 
Real eigenvalues, 
equal P 


Positive: star source 


Negative: star sink 


qd A 
qh qd 
Complex z : , 
eigenvalues D Pp Pp 
Positive real part: Negative real part: Pure imaginary: 
spiral source spiral sink centre 
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10. 


11. 


12. 


The equilibrium point is stable if it is a sink, a star sink, a spiral sink 
or a centre, and unstable if it is a source, a star source, a spiral 
source or a saddle point. 


To classify the equilibrium points of the non-linear system of 
differential equations x = u(x, y), do the following. 


(a) Find the equilibrium points using the method described in item 3. 


(b) Use the procedure described in item 6 to find the linear system 
that approximates the non-linear system in the neighbourhood of 
each equilibrium point. 


(c) Use the table in item 7 to classify the linear system, with the 
matrix A in item 7 equal to the linear approximation J(e, Ye) 
found in item 6. 


The behaviour of a system of non-linear differential equations near an 
equilibrium point is the same as the behaviour of the linear 
approximation in the neighbourhood, except when the linear system 
has a centre. If the linear system has a centre, then the equilibrium 
point of the original non-linear system may be a stable centre, a stable 
spiral sink or an unstable spiral source. 


Some equations of motion have a constant of motion K(z,y). This 
function has the property that K (a(t), y(t)) remains constant along 
any given phase path (x(t), y(t)). This implies that K satisfies a 
differential equation 


OK dx 2 OK dy _ 
Ox dt Oy dt — 
When there is a constant of motion, the phase paths must be contours 


of K(a,y). If the initial condition lies on a closed contour of K(x, y), 
then the motion will be periodic in t. 


0. 


An important application is population dynamics, where x(t) and 
y(t) are populations of two interacting species (usually predator and 
prey). The Lotka—Volterra equations are an important model: 


u(t), 4--w(-$) 


where k, h, X and Y are positive constants. 


The equilibrium points of the Lotka—Volterra equations are an 
unstable saddle at (0,0) and a stable centre at (X,Y). 


The Lotka—Volterra equations have a constant of motion 
K(a,y) =hinx+klny - a — Lay 
xX ¥ 
The contours are all closed curves (except for the x- and y-axes). 


A second-order differential equation 7 = f(x, #) can be converted into 
a pair of simultaneous first-order equations by setting y = «; the 
equivalent pair is 


L=yY, y = f(x,y). 


13. 


14. 


The undamped pendulum satisfies the differential equations 

g=y, y= —w'sing (-1<a2<7). 
This system has two equilibrium points: a stable centre at (0,0) and 
an unstable saddle at (7,0). 
The damped pendulum satisfies the differential equations 

E=y, y= —w'sing—ey (-—tr<a<n), 
where ¢€ is a positive constant. This system has two equilibrium points, 
at (0,0) and (z,0). The equilibrium point (0,0) is a stable spiral sink 
for 0 < € < 2w, a stable improper sink for ¢ = 2w, and a stable sink for 
€ > 2w. The equilibrium point (7,0) is an unstable saddle. 
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angle 
between vectors 34 
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back substitution 42 
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boundary condition 32, 


boundary value 32 
boundary-value problem 
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Cartesian components of a vector 33 


Cartesian coordinates 
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Cartesian unit vector 33, 40 
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chain rule of differentiation 20 
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circle 18 
circulation 63 
circumference of a circle 
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complementary function 
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explicit solution of a differential equation 27 
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factorial 23 
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first-order ordinary differential equation 27 
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Gauss’s theorem 63 

general solution 
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by substitution 25 

rules 25 
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